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Abstract of DEI 9840771 

A thermostable in vitro complex for template-dependent elongation of nucleic acids comprises a 
thermostable sliding clamp protein, which is connected with an elongation protein that shows 
thermostable polymerase activity is new. Independent claims are also Included for the following: (1) a 
thermostable accessoiy in vitro complex characterized in that it contains a sliding clamp protein and a 
coupling protein, as defined above; (2) a recombinant DNA sequence, which encodes a thermostable 
in vitro complex or a thermostable accessory complex; (3) a vector that contains recombinant DNA 
encoding a sliding clamp protein and a coupling protein and/or an elongation protein; (4) a host cell 
transformed with one or more vectors of (3); (5) a method for the production of a thermostable in vitro 
complex or thermostable in vitro accessory complex as above; (6) a method for template-dependent 
elongation of nucleic acids; (7) a method of marking nucleic acids through generating a single break in 
the phosphodiester bond of the nucleic acid chain and substituting a nucleotide at the break site with a 
marked nucleotide with the help of a polymerase, where the polymerase is a themnostable in vitro 
complex as above; and (8) a reagent kit for elongation and/or amplification and/or reverse-transcription 
and/or sequencing and/or marking of nucleic acids. 
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Description 

The present invention relates to a thermostable in vitro complex for template-dependent 
elongation of nucleic acids, a thermostable, prokaryotic, accessory in vitro complex, as well as 
DNA-sequences and vectors for encoding it. The invention further relates to the use of the 
inventive complex in methods of template-dependent elongation of nucleic acids, such as PGR 
reactions or those of DNA sequencing involving in vitro template-dependent DNA strand 
synthesis. Finally, the present invention relates to reagent kits for implementing the inventive 
method. 

DNA polymerases are members of a group of enzymes that utilize single stranded DNA 
as templates in the synthesis of a complementary DNA strand. These enzymes play an important 
role in nucleic acid metabolism, including the processes of DNA replication, repair and 
recombination. DNA polymerases have been identified in all cellular organisms, ranging fi'om 
bacterial to human cells, in many viruses and in bacteriophages (Romberg, A. & Baker, T.A. 
(1991) DNA Replication WH Freeman, New York, NY). Generally, archaebacteria and 
eubacteria are grouped together in the group of prokaryotes, organisms lacking a true cell 
nucleus, and are contrasted with eukaryotes, organisms having a true cell nucleus. Gommon to 
many polymerases from a wide variety of organisms are often similarities in amino acid 
sequence and structure (Wang, J., Sattar, A.K.M.A; Wang, G.G., Karam, J.D., Konigsberg, W.H. 
& Steitz, T.A. (1997) "Grystal Structure of pol a familily replication DNA polymerase fi'om 
bacteriophage RB69**, Gell 89, 1087-1099). Organisms, such as human organisms, possess a 
plurality of DNA-dependent polymerases, not all of which however are responsible for DNA 
replication. Several also effect DNA repair. Replicating DNA polymerase consists for the most 
part of protein complexes having multiple subunits which replicate the chromosomes of cellular 
organisms and viruses. A general characteristics of such replicating polymerases is a generally 
high degree of processivity, that is, their ability to polymerize thousands of nucleotides without 
disassociating from a DNA template (Romberg, A & Baker, T.A. (1991) DNA Replication. WH 
Freeman, New York, NY). 

Until recently, the only well understood highly processive replication mechanisms in 
cells were those used by cellular replicases (a protein complex exhibiting polymerase activity), in 
addition to the replication apparatus of the bacteriophages T4 or T7. The apparatus underlying 
this mechanism contains a protein having a ring-like structure, a "sliding" clamp which encircles 
the DNA, and binds the catalytic polymerase unit - the "elongation" protein to the DNA 
(Stukenberg, P.T., Studwell-Vaughan, P.S. & O'Donell, M. (1991) "Mechanism of the B-clamp 
of DNA polymerase III holoenzyme", J. Biol. Ghem. 266, 1 1328-1 1334; Kuriyan, J. & 
O'Donnel, M. (1993) "Sliding clamps of DNA polymerases" J. Mol. Biol. 234, 915-925. 
Frequently, the sliding clamp is bonded via one or more additional proteins, "coupling proteins" 
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to the elongation protein. The 3-dimensional structure of various sliding clamp proteins has 
already been determined: 

- that of the eukaryotic proliferating cell nuclear antigen (PCNA) (Krishna, T.S.R., 
Kong, X.-P., Gary, S., Burgers, P.M. & Kuriyan, J. (1994) "Crystal structure of the 
eukaryotic DNA polymerase processivity factor PCNA", Cell 79, 1233-1243; Gulbis, 
J.M., Kelman, Z., Hurwitz, J., O'Donnel, M. & Kuriyan, J. (1996) "Structure of the 
C-terminal region of p21 WAFl/CIPl complexed with human PCNA", Cell 87, 297- 
306), 

- that of the 6 subunit of polymerase III of the eubacterium Escherichia coli), (Kong, 
X.-P., Onrast, R., O'Dqnnell, M. & Kuriyan, J. (1992) "Three dimensional structure 
of the 6 subunit of Escherichia coli DNA polymerase III holoenzyme; a sliding DNA 
clamp". Cell 69, 425-437) 

and that of the bacteriophages T4 Gen45 proteins (Kelman, Zvi, Hurwitz, J. 
O'Donnel, Mike (1998) Structure 6, 121-125). 

Such sliding clamps are very similar in their overall structure: Superimposed, the images 
of the overall protein structure of PCNA, the B subunit, and of the gp45 rings are congruent 
(Kelman, Z. & O'Donnell, M. (1995) "Structural and functional similarities of prokaryotic and 
eukaryotic sliding clamps", Nucleid Acids Res. 23, 3613-3620). Each ring is comparably 
dimensioned with a central opening large enough to encircle duplex-DNA, that is, a double 
strand of DNA composed of two complementary DNA strands. 

In vivo, the sliding clamp itself is unable to position itself around the DNA; rather it has 
to be fixed around the DNA by means of a clamp loader. Such a clamp loader is a protein 
complex which in prokaryotes and eukaryotes consists of a plurality of subunits, and which is 
called y-complex in conjunction with the eubacterium Escherichia coli, and replication factor C 
(RF-C) in conjunction with hximans (Kelman, Z. & O'Donnell, M. (1994) "DNA replication - 
enzymology and mechanisms", Curr. Opin. Gent. Dev. 4, 185-195). The sliding clamp 
recognizes the 3 '-end of the single strand duplex (primer-template) and positions the sliding 
clamp in the presence of ATP around the DNA. The sliding clamp which encircles the DNA 
interacts with polymerases, thereby ensuring rapid and processive DNA synthesis. 

In the case of bacteriophage T7, the same goal, i.e. processive DNA synthesis, is 
achieved by means of a structurally different protein complex. The phage expresses its own 
catalytic polymerase, the T7 polymerase, the product of gene 5, which enters into a bond with a 
protein derived from the host Escherichia coli, thioredoxin and, as a replicase, enables highly 
processive DNA repHcation (Proc Natl Acad Sci USA October 15, 1992; 89(2):9774-9778 
"Genetic analysis of the interaction between bacteriophage T7 DNA polymerase and Escherichia 
coli thioredoxin", Himawan J.S., Richardson C.C.). This also leads to clamp formation, although 
the structure of said clamp is not the same as, for example, in the case of eukaryotic PCNA. 

It is often necessary ~ as for example in the case of human polymerase 8 ~, for proteins 
(coupling proteins) to create a link between the catalytically active portion of the polymerase and 
the processivity factor (sliding clamp). In humans, this is the small subxmit of the 5 polymerase 
(Zhang, S.-J., Zeng, X.-R., Zhang, P., Toomey, N.L., Chuang, R.Y., Chang, L.-S., and Lee, 
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M.Y.W.T. (1994), "A conserved region in the amino terminus of DNA polymerase 5 is involved 
in proliferating cell nuclear antigen binding", J. BioL Chem. 270, 7988-7992. In the case of T7 
polymerase, however, the processivity factor directly binds the catalytically active xmit of the 
polymerase. 

DNA polymerases are characterized by two properties, among others, their rates of 
elongation, that is, the number of nucleotides they are able to incorporate per second into a 
growing strand of DNA, and their dissociation constants. If, after each incorporation step, a 
polymerase again dissociates from the strand one of the nucleotides in the growing chain (that is, 
one elongation step occurs per bonding event), processivity then has a value of 1 and the 
polymerase is non-processive. If the polymerase remains bonded to the strand during repeated 
nucleic acid incorporation, then the replication modus is termed processive and may achieve a 
value of several thousand (see also: Methods in Enzymology, Volume 262, DNA Replication, 
Edited by J.L. Campbell, Academic Press 1995, pp. 270-280). 

For most in vitro applications, such as PGR or sequencing processes, processivity is a 
desirable characteristic, but one which to date is possessed by thermostable enzymes used in 
such reactions to only a small degree. In contrast, the temperature sensitive T7 polymerase 
associated with thioredoxin exhibits a processivity of several thousand nucleotides. By 
comparison, the Thermus thermophilus or aquaticus-derived thermostable DNA polmerases have 
a processivity of approximately 50 nucleotides (Biochim Biophys Acta Nov. 7, 1995; 
1264(2):243-248, "Inactivation of the 5'-3' exonuclease of Thermus aquaticus DNA 
polymerase", Markens L.S., Bryan, S.K., Mokses R.E.). 

U.S. Patents 4,683,195, 4,800,195 and 4,683,202 describe the application of such 
thermostable DNA polymerases in polymerase chain reactions (PGR). In a PGR, a DNA is 
newly synthesized using primers, templates, nucleotides a DNA polymerase of a corresponding 
buffer and under suitable reaction conditions. During the process the double-stranded target 
sequence is most conmionly fused, two oligonucleotides are hybridized thereto and the 
complementary sequence is then synthesized by the polymerase on the template through 
incorporation of nucleotides. The extension product of each primer functions as a template for 
the next cycle. In PGR of this type, it is preferable to use a thermostable polymerase which 
survives the cyclical, thermal fusion of the DNA strands. Thus, Taq DNA polymerase is 
frequently used (U.S. Patent 4,965,188). However, as described above, the processivity of Taq 
DNA is relatively small as compared to T7 polymerase. 

DNA polymerases are also used in DNA sequence determination (Sanger et al., Proc. 
Natl. Acad. Sci., USA 74:5463-5467 (1997). Per Sanger, in sequence determination a T7 DNA 
polymerase is frequently used (Tabor, S. and Richardson, G.G., Proc. Natl. Acad. Sci., USA 
86::4076-4080( 1989)). Subsequently, the cycle-sequencing method was developed (Murray, V. 
(1989) Nucleic Acids Res. 17, 8889), which does not require a single stranded template and 
which permits initiation of the sequence reaction with relatively small amoxints of template. 
Polymerases utilized in this method may include, for example, the aforementioned Taq 
polymerase (U.S. Patent 5,075,216) or Thermotoga neapolitana-derived polymerase 
(W096/ 10640), or other thermostable polymerases. Newer methods combine the exponential 
amplification and sequencing of a DNA fragment in one step, allowing for direct sequencing of 
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genomic DNA. One such method, the so-called DEXAS-method (Nucleic Acid Res., May 15, 
1997; 25(10): 2032-2034, "Direct DNA sequence detemiination from total genomic DNA", 
Kilger C, Paabo, S., Biol Chem, Feb. 1997; 378(2):99-105, "Direct exponential amplification 
and sequencing DEXAS) of genomic DNA", Kilger, C, Paabo, S, and DE 196 53 439.9, as well 
as DE 196 53 494.1) employs a polymerase with reduced discrimination capacity vis-a-vis 
dideoxynucleotides (ddNTPs) as compared with deoxynucleotides (dNTPs), further a reaction 
buffer, two primers which are preferably not equimolar, and the aforemtioned nucleotides, 
whereby in multiple cycles a complete, sequence-specific DNA ladder of a fragment is then 
obtained which is encompassed by the primers. In a further modification of this method a 
polymerase mix is used, in which one of the two polymerases discriminates between ddNTPs 
and dNTPS, while the other displays a reduced discrimination capacity (Nucleic Acids Res., May 
15, 1997; 25(10):2032-2034, "Direct DNA sequence determination from total genomic DNA", 
Kilger C, Paabo, S). 

DNA polymerases are also used in reverse transcription of RNA into DNA. Here, RNA 
serves as a template and the polymerase synthesizes a complementary DNA strand, using, for 
example, the thermostable DNA polymerase derived from the Thermus thermusphilus (Tth) 
organism (U.S. Patent 5,322,770). 

It may also be desirable for the polymerase to exhibit "proof-reading" activity, that is a 
3 '-5' exonuclease activity. This characteristic is particularly desirable when the product being 
synthesized is to be generated at a low error rate during nucleotide incorporation. 

The aforementioned enzymes that are commonly used in PCR-reactions, are not for the 
most part members of the group of actual replication enzymes, but rather are mostly enzymes 
that are assumed to participate in DNA repair, which accounts for their low degree of 
processivity. 

It is thus an object of the present invention to combine several of the aforementioned 
characteristics of polymerases, in particular high processivity and thermostability, for use in in 
vitro reactions. 

This objective has been achieved in accordance with the present invention by the 
preparation of a thermostable in vitro-complex for template-dependent elongation of nucleic 
acids, comprising a thermostable sliding clamp protein that is connected to an elongation protein 
exhibiting thermostable polymerase activity. The complex may be utilized in in- vitro reactions, 
such as PCR-reactions in which it exhibits high processivity. It is also advantageous if the 
complex exhibits a low error rate during nucleotide incorporation, that is, it possesses enhanced 
fidelity. Hence, it is possible to use the complex in conjunction with elongation, amplification 
and sequencing of nucleic acids. The complex may preferably be utilized in Standard-PCR- 
reactions. 

To ensure the suitability of such a complex in Standard-PCR-reactions, it must be easy to 
manipulate. Thus, the reaction apparatus of the simian virus 40 has been constituted in vitro 
(Wage, S. & Stilhnan, B. (1994) Nature, vol. 369:207-221), but such a reaction apparatus is 
unsuitable for Standard-PCR reactions because among other things, it is not thermostable. 
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Hence, the object of the present invention is the preparation of a thermostable in vitro-complex 
exhibiting polymerase activity that is useable in Standard-PCR reactions and which displays high 
processivity. 

hi particular, the object of the present invention is a thermostable prokaryotic, in vitro 
complex for elongation of nucleic acids, which comprises a thermostable sliding clamp protein 
that encircles entirely or in part the complementary nucleic acid strand, and a thermostable 
protein exhibiting polymerase activity, wherein said protein or protein complex is coupled with 
the sliding clamp protein. 

Within the scope of the present invention the concept of an elongation protein exhibiting 
polymerase activity also encompasses protein complexes that exhibit polymerase activity, or 
subunits of such complexes which carry the polymerase activity. 

"Thermostable" as used in the present invention means that the accessory complex with 
high processivity incorporates nucleotides in growing nucleic acid chains, under both low and 
high temperatures that occur in PGR or other reactions, such as for example DNA sequencing. 

As a rule, PGR is consists, for example, of the steps of denaturing (70° to 98'C), 
annealing (40° to 78'G) and DNA strand synthesis (60° to 76°G). Therefore, the complex must 
be functional at least between about 60° and about 70°G, in particular between 60° to 76"G, and 
especially preferably between 40° to 98°C. No irreversible denaturing of the complex or 
individual components may occur over the course of the entire reaction that will stop or inhibit 
the elongation reaction. 

The sliding clamp protein and the elongation protein exhibiting polymerase activity may 
be coupled to one another by covalent or non-covalent bonding, hi a preferred embodiment the 
sliding clamp protein and the elongation protein are connected by means of a coupling protein. 

In another preferred embodiment of the present invention the associated proteins within 
the inventive thermostabile complex derive from archaebateria, though within the scope of the 
present invention it is also feasible that the associated proteins may derive from eubacteria. An 
object of the present invention is further a thermostable in vitro complex according to the present 
invention, in which the associated proteins are derived in part from archaebacteria and in part 
from eubacteria. 

Within the meaning of the present invention, the notion prokaryotic protein encompasses 
both proteins derived from archaebacteria and proteins derived from eubacteria. The replication 
apparatus in archaea is known to be similar to the eukaryotic replication apparatus, even though 
the genomic organization in eukaryotes and archae are complete different, and the cellular 
structure of eubacteria resembles that of archaea (Edgell, D.R. and Doolittle, W.F. (1997), 
"Archaea and the origin(s) of DNA replication proteins", Gell 89:995-998. 

Thus, a preferred embodiment of the present invention is a thermostable prokaryotic in 
vitro complex in which a protein complex exhibiting polymerase activity is present which 
consists of a coupling protein and one or more elongation proteins exhibiting polymerase 
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activity. Thus, a thermostable prokaryotic in vitro complex according to the present invention is 
preferred in which the sliding clamp protein has a ring-shaped structure that encircles entirely or 
in part the complementary nucleic acid strands. 

The Sliding Clamp Protein 

The following explanations are intended to assist in better understanding the function and 
potential manifestations of the sliding clamp protein. 

The sliding clamp protein serves the function of binding the polymerase activity to the 
DNA. Either the sliding clamp protein itself encircles the DNA completely or in part, or a clamp 
is formed by association to the protein exhibiting polymerase activity or to a protein complex 
exhibiting polymerase activity, or to a subunit thereof In any case, the level of processivity is 
significantly increased, at least by a factor of 1 Vi as a result of the clamp formation. 

That means that the in vitro complex according to the present invention exhibits at least 1 
/4 times as much processivity as compared to the elongation protein alone, or as compared to a 
protein complex exhibiting polymerase activity without a sliding clamp, or a subunit thereof 

Functioning slide clamps may be, for example, human genome-derived homologs of the 
"Proliferating Cell Nuclear Antigen"- protein complex, or homologs also of the ring-shaped, E- 
coli-derived "p-clamp"- protein complex, which are derived from thermostable organisms and 
thus are thermostable, or from non-thermostable organisms, which are subsequently rendered 
theremostable by altering the amino acid sequence (Eijsink V.G., van der Zee J.R., van den Burg 
B., Vriend G. Venema, G. FEBS Lett, April 22, 1991, 282(1):13-16, Improving the 
thermostability of the neutral protease of Bacillus stearothermophilus by replacing a buried 
asparagine by leucine", Bertus Van den Burg, Gert Vriend, Oene R. Veltman, Geard Venema, 
and Vincent G. H. Eijsink, "Engineering an enzyme to resist boiling", PNAS 1998, 95:2056- 
2060). The sliding clamp may be constructed of multiple components. The sliding clamp 
identified in the human genome consists of three PCNA-protein components (SEQ ID NO: 1 1) 
(homotrimers), the sliding clamp identified in the E. coli genome consists of two components 
(SEQ ID NO: 35) (homodimers). 

Within the meaning of the present invention a slide clamp is understood to be in 
particular any protein that possesses the fiinctional property of polymerase activity enhancement 
or serves to lower the error rate. In addition, the sliding clamp may exhibit a ring-shaped, three- 
dimensional structure, or form a ring-shaped, three-dimensional structure by coupling to another 
protein, which enables it to encircle, either entirely or in part, single and double-stranded DNA. 

A sliding clamp within the meaning of the present invention is understood to be a protein which 

1. exhibits in a sequence alignment to the human (eukaryotes) PCNA amino acid 
sequence (SEQ ID NO: 1 1) having a length of at least 100 amino acids at least a 20% sequence 
identity, or 
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2. exhibits in a sequence alignment to the bacterial p-clamp sequence derived from E. 
coli (eubacteria) (SEQ ID NO: 35) having a length of at least 100 amino acids at least a 20% 
sequence identity, or 

3. exhibits in a sequence alignment to the amino acid sequence of the PCNA homologs 
derived from Archaeoglobus Fulgidus (Archaea) (SEQ ID NO: 12) having a length of at least 
100 amino acids at least a 20% sequence identity. 

The sliding clamp protein according to the present invention may have one or more of the 
aforementioned features. 

The sequence identities cited in Fig 1 were determined with the help of the BLAST 
Algorithm according to Altschul, S.F., Gish, W., Miller, W., Myers, E.W., and Lipman, D.J., J. 
Mol. Biol. 215:403-410 (1990) and are discussed in detail below. 

Sliding clamps within the meaning of the present invention are also understood to be 
proteins that contain one or both of the following consensus sequences and which deviate from 
these sequences at no more than four positions (Abb.4): 

Region 1 

(SEQ ID NO: 39) 

[G A V L I M P F W]-D-X-X-X-[G A V L I M P F W]-X-X-[G A V L I M P F W]-X-[G A V L I 
M P F W]-X-[G A V L I M P F W]-X-X-X-X-F-X-X-Y-X-X-D and/or 

Region 2 

(SEQ ID NO: 40) 

[G A V L I M P F W]-X(3)-L-A-P-[K R H D E]-[G A V L I M P F W]-E. 

The amino acids are designed here using the standard lUPAC single letter nomenclature 
and cited in accordance with the Prosite Pattem description standard, and where the following 
amino acid groups are frequently Ixmiped together: 

G, A, V, L, I, M, P, F or W (amino acids with non-polar side chains) 
S, T, N, Q, Y, or C (amino acids with non-charged polar side chains) 
K, R, H, D or E (amino acids with charged and polar side chains) 

In addition, X stands for any arbitrary amino acid, insertion or deletion in the sequence listing. 

Also generated from the multiple alignment of human PCNA homologs shown in Fig. 12 
was a Hidden Markov Model. Thus, a sliding clamp within the meaning of the present invention 
is understood to be in particular any protein that with a Hidden Markov Model so generated has a 
score of greater than 20 (Fig, 12). The Hidden Markov Models and corresponding scores were 
calculated using the hmmfs program (Version 1 .8.4, July 1997) from the HMMER packet 
(HMMER Protein and DNA Hidden Markov Models (Version 1.8) by Sean Eddy, Dept. of 
Genetics, Washington University School of Medicine, St. Louis, USA). 
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A Hidden Markov Model was generated from the multiple alignment of E. coli p-clamp 
homologs shown in Fig. 13. Thus, a sliding clamp within the meaning of the present invention is 
xmderstood to be in particular any protein that with a Hidden Markov Model so generated has a 
score of greater than 25 (Fig. 13). 

The sliding clamp may be constructed of multiple components that are fixedly boimd to 
one another by a characteristic bond to thereby form a stable ring-shaped molecular complex 
which cannot readily dissociate from the DNA. This makes possible a fixed but non-covalent 
bond to the DNA, but which does not inhibit free displaceability along the latter. In addition, the 
processivity-enhancing sliding clamp proteins possess characteristic local molecular properties in 
the region of interaction with the DNA, which improve free displaceability, and which may then 
be supported via water molecules stored in this region. 

A preferred embodiment of the present invention is fiirther in particular a thermostable 
prokaryotic in vitro complex, wherein the sliding clamp protein is one of the following: 
Archaeoglobus Fulgidus- derived AF0335, Methanococcus Jannashcii-derived MJ0247, 
Pyrococcus Horikoschii-derived PHLA008, Methanobacterium Thermoautotrophicus-derived 
MTH1312, as well as Aquifex Aeolicus-derived AE000761_7. 

In particular, the subject matter of the present invention encompasses thermostable 
prokaryotic in vitro complexes, in which the sliding clamp protein has an amino acid in 
accordance with SEQ ID NO: 11, 12, 13, 14, 15 and 36 (Aquifex Aeolicuc). 

Sliding Clamp Loader 

In addition, a preferred embodiment is understood to be one in which part of the complex 
according to the present invention is a sliding clamp loader, which assembles the components of 
the sliding clamp around the continuous DNA strand, or removes them when the reaction is 
completed. Said sliding clamp loader is preferably associated with the in vitro complex of the 
present invention. 

In humans the sliding clamp loader consists of five subunits, 4 small (sliding clamp 
loader 1) and one large subunit (sliding clamp loader 2). Functional sliding clamp loaders in 
accordance with the present invention are, for example, one or more prokaryotic homologs of 
"Replication "-Factor C-protein complex identified in humans (sliding clamp loader 1 :SEQ ID 
NO: 1, 32, 33, 34 and sliding clamp loader 2 SEQ ID NO: 6). 

A sliding clamp loader within the meaning of the present invention is understood to be in 
particular a protein that exhibits in a sequence alignment to the human (eukaryotic) amino acid 
sequence (SEQ ID NO: 1, 23, 33, 34 ) having a length of at least 100 amino acids at least a 20% 
sequence identity. 

A sliding clamp loader 2 within the meaning of the present invention is also understood 
to be in particular a protein that exhibits in a sequence alignment to the hxmian (eukaryotic) 
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amino acid sequence (SEQ ID NO: 6) having a length of at least 1 50 amino acids at least a 20% 
sequence identity. 

Suitable clamp loaders are, for example, the Arachaebacteria-derived homologs cited in 
Fig. 1, (SEQ ID NO: 3, 4, 5; homologs to sliding clamp loader 1 and SEQ ID NO: 8, 9, 10, 
homologs to sliding clamp loader 2). Therefore, a thermostable prokaryotic in vitro complex is 
especially preferred, in which an RFC-homolog protein or protein complex is also present. 

Thus, for example, a sliding clamp loader 1 within the meaning of the present invention 
is also understood to be any protein that contains the following consensus sequences and which 
deviates from these sequences at no more than four positions (Abb.6): 

SEQ ID NO: 41 

C-N-Y-X-S-[K R H D E ]-I-I-X-[G A V L I M P F W]-[G A V L I M P F W]-Q-S-R-C-X-X-F- 
R-F-X-P-[G A V L I M P F W]. 

Thus, for example, a sliding clamp loader 2 within the meaning of the present invention 
is also understood to be any protein that contains the following consensus sequences and which 
deviates from these sequences at no more than four positions (Abb. 7): 

SEQ ID NO: 42 

K-X-X-L-L-X-G-P-P-G-X-G-K-T-[S T N Q Y C]-X-[G A V L I M P F W]-X-X-[G A V L I M 
P-F W] 

Additionally, a Hidden Markov Model was generated from the multiple alignment of 
hximan sliding clamp loader 1 homologs shown in Fig. 14. Thus, a sliding clamp loader 1 within 
the meaning of the present invention is understood to be in particular any protein that with a 
Hidden Markov Model so generated has a score of greater than 25 (Fig. 14). 

Additionally, a Hidden Markov Model was generated from the multiple alignment of 
human sliding clamp loader 2 homologs shown in Fig. 15. Thus, a sliding clamp loader 2 within 
the meaning of the present invention is understood to be in particular any protein that with a 
Hidden Markov Model so generated has a score of greater than 1 5 (Fig. 1 5). 

It is also preferred a sliding clamp loader is present in the form of a protein that is 
homologous with the Eubacteriimi Eschereichia coli y-complex. 

Also preferred in accordance with the present invention is a thermostable in vitro 
complex for the elongation of nucleic acids, in which apart from the sliding clamp loader, ATP 
may also be present, preferably also associated therewith. 

Coupling Protein 

The fimction of the coupling protein is to connect the elongation protein with the sliding 
clamp protein. A coupling protein within the meaning of the present invention is xmderstood to 
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be in particular any protein having the above described function. Suitable coupling proteins are, 
for example, the Archaebacteria-derived homologs to the human sequence of the coupling 
subunit (DPD2-HUMAN, [SEQ ID NO: 16] cited in Fig. 1 (SEQ ID NO: 17, 18, 19, 20, 21). 

A coupling protein within the meaning of the present invention is understood to be in 
particular a protein that exhibits in a sequence alignment to the human (eukaryotic) amino acid 
sequence (SEQ ID NO: 16) having a length of at least 1 50 amino acids at least a 18% sequence 
identity. 

Thus, a coupling protein within the meaning of the present invention is understood to be 
any protein that contains the following consensus sequence and deviates from this sequence at no 
more than four positions (Fig. 5); 

SEQ ID NO: 43 

[FL]-G A V L I M P F W]-X-X-[G A V L I M P F W]-X-G-X(13)-[G A V L I M P F W]-X- 
[YR]-[G A V L I M P F W]-X-[G A V L I M P F W]-A-G-[DN]-[G A V L I M P F W]-[G A V 
L I M P F W]-[DS]. 

A Hidden Markov Model was also generated from the multiple alignment of human 
coupling subunit homologs shown in Fig. 16. Thus, a coupling subunit within the meaning of 
the present invention is understood to be any protein that with a Hidden Markov Model so 
generated has a score of greater than 10 (Fig. 16). 

Elongation protein exhibiting polymerase activity, or enzyme complex exhibiting polymerase 

activity or a subimit thereof 

Some elongation proteins require the presence of a coupling protein (coupling subunit) in 
order to exhibit polymerase activity at all. However, it is conceivable that the elongation protein 
may bind directly to the sliding clamp. Other elongation proteins require the presence of a 
coupling protein for bonding with the slide clamp. 

The elongation protein exhibits 5 '-3' polymerase activity or reverse-transcriptase activity. 
Suitable proteins in such case are, for example, Archaebacteria-derived homologs of the human 
elongation protein (SEQ ID NO: 22) cited in Fig. 1 (SEQ ID NO: 23, 24, 25, 26). 

Preferably, the elongation protein binds to the coupling protein, which in tum is coupled 
to the sliding clamp protein. 

An elongation protein within the meaning of the present invention is understood to be in 
particular a protein that exhibits in a sequence alignment to the hxunan (exikaryotic) amino acid 
sequence (SEQ ID NO: 22) having a length of at least 200 amino acids at least a 20% sequence 
identity. 
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Thus, an elongation protein within the meaning of the present invention is understood to 
be in particular any protein that contains the following consensus sequence and deviates from 
this sequence at no more than four positions (Fig. 8): 

SEQ ID NO: 44 

D-[G A V L I M P F W]-[G A V L I M P F W]-X-X-Y-N-X-X.X-F.D-X-P-Y-[G A V L I M P F 
W]-X-X-R-A. 

A Hidden Markov Model was also generated from the multiple alignment of human 
elongation protein homologs (SEQ ID NO: 22) shown in Fig. 17. Thus, an elongation protein 
within the meaning of the present invention is understood to be any protein that with a Hidden 
Markov Model so generated has a score of greater than 20 (Fig. 17). 

An elongation protein within the meaning of the present invention is understood to be in 
particular a protein that exhibits in a sequence alignment to the archaeabacterial amino acid 
sequence (SEQ ID NO: 27) having a length of at least 400 amino acids at least a 25% sequence 
identity. For example, suitable proteins are those derived from Archaeabacteria with the SEQ ID 
NO: 28, 29, 30 or 31. 

Thus, an elongation protein within the meaning of the present invention is understood to 
be in particular any protein that contains the following consensus sequence and deviates from 
this sequence at no more than four positions (Fig, 9): 

SEQ ID NO: 45 

A-[G A V L I M P F W]-R-T-A[G A V L I M P F W]-A-[G A V L I M P F W]-[G A V L I M P 
F W]-T.E.G-[G A V L I M P F W]-V-X-A-P-[G A V L I M P F W]-E.G-I-A-X-V-[K R H D E]- 
1. 



A Hidden Markov Model was also generated from the multiple alignment of human 
elongation protein homologs (SEQ ID NO: 27) shown in Fig. 18. Thus, an elongation protein 
within the meaning of the present invention is mderstood to be any protein that with a Hidden 
Markov Model so generated has a score of greater than 35 (Fig. 18). 

The elongation protein may also be eubacterial in origin. 

Thus, an elongation protein within the meaning of the present invention is also 
understood to be a protein that exhibits in a sequence alignment to the eubacterial amino acid 
sequence (SEQ ID NO: 37) having a length of at least 300 amino acids at least a 25% sequence 
identity. 

Thus, an elongation protein within the meaning of the present invention is also 
understood to be in particular any protein that contains the following consensus sequence and 
deviates from this sequence at no more than four positions (Fig. 10): 
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SEQ ID NO: 46 

[G A V L I M P F W]-P-V-G-[G A V L I M P F W]-G-R-G-S-X-[G A V L I M P F W]-G-S-[G 
A V L I M P F W]-V-A-X-A-[G A V L I M P F W]-X-I-T-D4G A V L I M P F W]-D-P-[G A V 
L I M P F W]-X-X-X-[G A V L I M P F W]-L-F-E-R-F-L-N-P-E-R-[G A V L I M P F W]-S-- 
M-P-D. 

A Hidden Markov Model was also generated from the multiple alignment of eubacterial 
elongation protein homologs (SEQ ID NO: 37) shown in Fig. 19. Thus, an elongation protein 
within the meaning of the present invention is understood to be any protein that with a Hidden 
Markov Model so generated has a score of greater than 20 (Fig. 19). 

Heretofore, some DNA polymerases have been used as elongation proteins without 
coupling proteins and without sliding clamps in Standard PGR reactions, such as, for example, 
DNA polymerase I derived from Pyrococcus furiosus (U.S. Patent No. 5,545,552) or from 
Pyrococcus species (EP Patent Application No. 0 547 359 Al). Such enzymes have the 
distinctive property of being thermostable and frequently of possessing 3 '-5' exonuclease 
activity ('proof-reading activity'). Only recently was a heterodimer having polymerase activity 
discovered in Pyrococcus fiiriosus (Uemori, T., Sato, Y., Kato, L, Doi, H., and Ishino, y. (1997). 
"A novel DNA polymerase in the hyperthermophilic archaeon, pyrococcus ftiriosus: gene 
cloning, expression, and characterization", Genes to Cells 2:499-512.) 

Within the scope of the present invention, it is preferable that the inventive prokaryotic in 
vitro complex for elongating nucleic acids consist of proteins derived from Archaea. That is, it is 
preferable to utilize a sliding clamp protein derived from Archaeabacteria. Further, it is 
preferable for the elongation protein or a protein complex exhibiting polymerase activity 
consisting of an elongation protein and coupling proteins, to be derived from Archaeabacteria. 

Examplars of proteins suitable for the complex according to the present invention are 
shown in Figs. 1 and 2. 

Of course, it is possible to optimize the properties of said proteins by deletions or 
mutations, or by addition of amino acids. Such altered proteins as well form the subject matter 
of the present invention, to the extent that they form the in vitro complex for elongating nucleic 
acids in accordance with the present invention, in addition to fulfilling the fimctions described in 
greater detail above. 

The complex according to the present invention is illustrated in Fig. 3, which shows by 
way of example the replication apparatus in a potential variant, wherein the sliding clamp is 
bonded by means of a coupling subunit to the elongation protein. 

Further, it is preferable that in addition to the prokaryotic accessory in vitro complex 
according to the present invention, two primers are also present. As a rule, primers are 
oligonucleotides that bond to both complementary DNA strands of a target sequence, in which 
disposed in opposite orientation with their 3' ends directed towards one another, they enclose the 
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segment being amplified. They serve as a starting point for amplification and generally provide 
the polymerase with a fi-ee 3 '-OH end for incorporating a nucleotide. 

As the protein complex is used for amplification, elongation and sequencing, the complex 
according to the present invention is present preferably in a suitable buffer. Suitable buffers are 
those that can be utilized for PGR, sequencing, nucleic acid labeling and for other in vitro nucleic 
acid elongation reactions by means of polymerases. Suitable buffers are described, for example, 
in Methods in Molecular Biology, Vol. 1 5, Humana Press Totowa, New Jersey, 1993, edited by 
Bruce A. White. 

As the protein complex is used for elongation, amplification, reverse transcription and/or 
sequencing, there is present, in addition to the complex according to the present invention, a 
mixture of nucleotides. Deoxynucleotides may be selected fi-om dGTP, dATP, dTTP and dCTP, 
but are not limited to these. In addition, derivatives of deoxynucleotides are also feasible 
according to the present invention, the former being defined as those deoxynucleotides capable 
of being incorporated by way of a thermostable polymerase in growing DNA molecules that are 
being synthesized in the reaction. Such derivatives include, but are not limited to, 
thionucleotide, 7-deaza-2'-dGTP, 7-deaza-2'-dATP as well as deoxyinosine triphosphate, which 
may also be used as substitute deoxynucleotide for dATP, dOTP, dTTP or dCTP. It is also 
feasible to use labeled deoxynucleotides. All known forms of labeling and/or those suitable for 
the purpose of the present invention may be present. 

Dideoxynucleoties may be selected from ddOTP, ddATP, ddTTP and ddCTP, but are not 
limited to these. In addition, derivatives of dideoxynucleotides are also feasible according to the 
present invention, the former being defined as those dideoxynucleotides capable of being 
incorporated by way of a thermostable polymerase in growing DNA molecules that are being 
synthesized in the reaction. Such derivatives include, but are not limited to, radioactive 
dideoxynucleotides (ddATP, ddOTP, ddTTP and ddCTP) or dideoxynucleotides (ddATP, 
ddOTP, ddTTP and ddCTP), that are labeled, amongst others, with, e.g. FITC, Cy5, Cy5.5, Cy7 
and Texas red. Within the scope of a sequencing according to the present invention it is also 
feasible to use labeled deoxynucleotides together with imlabeled dideoxynucleotides. 

Ribonucleotides may be selected from OTP, ATP, TTP and CTP, but are not limited to 
these. In addition, derivatives of ribonucleotides are also feasible according to the present 
invention, the former being defined as those ribonucleotides capable of being incorporated by 
way of a thermostable polymerase in growing DNA molecules that are being synthesized in the 
reaction. Such derivatives include, but are not limited to, radioactive ribonucleotides (ATP, 
OTP, TTP and CTP) or ribonucleotides (ATP, OTP, TTP and CTP), that are labeled, amongst 
others, with, e.g. FITC, Cy5, Cy5.5, Cy7 and Texas red. 

When using the protein complex for amplification, elongation and sequencing, it may 
prove advantageous if a pyrophosphatase is present during the reaction. 

Fxirther subject matter of the present invention is a thermostable, accessory in vitro 
complex comprising both a sliding clamp protein and a coupling protein, wherein said proteins 
are defined as above. 
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The accessory in vitro complex according to the present invention is to a certain extent to 
be understood as a precursor to the above described inventive thermostable in vitro complex for 
template-dependent elongation of nucleic acids. 

The accessory complex need only be combined with a single elongation protein in order 
to confer on the latter significantly enhanced processivity. Thus, it is feasible to use the 
accessory in vitro complex in combination with known thermostable polymerases, wherein the 
disadvantages of such known proteins as well, in particular with regard to processivity, are 
reduced. 

Gene identification, gene cloning, gene expression and purification of proteins of the in 
vitro complex according to the present invention: 

Generally, complexes consisting of recombinant proteins may be prepared according to 
following steps: Preparation of the nucleic acid fi'agment that encodes for the desired protein, 
ligation in an expression vector, transformation into a host, expression and purification of the 
protein. In accordance with the present invention, it may happen that genes, in particular 
Archaebacterial derived, may contain inteins that must first be removed (Proc Natl Acad Sci 
USA, June 15 1992; 89(12):5577-5581, "Intervening sequences in an Archaea DNA polymerase 
gene", Perler, F.B., Comb, D.G., Jack, W.E., Moran, L.S., Qiang, B., Kucera, R.B., Benner, J., 
Slatko, B.E., Nwankwo, D.O., Hempstead, S.K., et al.) 

Identification of additional proteins suitable for the complex of the present invention may 
be accomplished, e.g. through homology searching in databases that encompass prokaryotic 
genomes. Several programs are suitable for such purpose, such as for example, the program 
BLASTP and FASTA (Altchul, Stephen F., Thomas L. Madden, Alejandro A., Schaffer, Jinghui 
Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997, "Gapped BLAST and PSI- 
BLAST: a new generation of protein database search programs". Nucleic Acids Res. 25:3389- 
3402. W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448). 

Identification may also be accomplished using DNA probes, for the purpose of screening 
for suitable genes in e.g. total-genomic prokaryotic databases. The experimental methods 
required for this are found in Maniatis et al. (Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor Laboratory Press, N.Y. (1989). 

The purified nucleic acid of the genes of the inventive complex may be prepared, e.g. by 
isolating them fi"om a genomic database of the relative organism or by means of synthetically 
manufactured DNA, in each case optimally in combination with PCR amplification with the help 
of primers specific to the desired gene segment. Conventional methods are described in Maniatis 
et al. (Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, N.Y. 
(1989). 

The genes of the proteins of the inventive in vitro complex can be cloned in accordance 
with a nimiber of techniques, and thus be made available using an expression vector for protein 
expression in a host organism. Current methods are described in Maniatis et al. (Molecular 
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Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, N.Y. (1989). Here, the 
genes of the complex may e.g. first be cloned in a "high-copy" vector, for example, pUC18, pBst 
or pBR322, and only then recloned in a prokaryotic expression vector, such as pTrc99, or 
alternatively directly cloned in prokaryotic expression vector. Here, vectors are understood to be 
nucleic acids capable of transporting a second nucleic acid molecule into or between different 
organisms, or between genetic backgrounds. As a rule, they have the capability of autonomic 
replication and/or expressing (expression vectors) the operatively boxmd nucleic acid molecule. 
"Operatively bound" means that the transported nucleic acid molecule is connected to the vector 
in such a way that transcription and translation thereof are controlled by expression control 
sequences of the vector and may be expressed in a host cell. Bacterial expression systems, their 
preferred use and a selection of vector systems are described, e.g. in "Gene Expression 
Technology", (Method. EnzymoL, Vol. 185, Goeddel, Ed., Academic Press, N.Y. (1990). 
Vectors suitable for the present invention should enable various expression levels of the proteins 
by possessing some or all of the following characteristics: (1) promoters, or transcription 
initiation sites, either directly at the start of the protein, or as fusion protein (2) operators that 
may be used to switch gene expression on or off, (3) ribosomal binding sites for improved 
translation, and (4) transcription or translation termination sites which lead to improved stability. 

Expression vectors compatible with eukaryotic cells, preferably with vertebrate cells, 
may also be used. Some known vectors are pSVL and pKSV-10 (Pharmacia), pBPv-l/pML2d 
(Intemational Biotechnologies, Inc.) and pTDTl (ATCC 31255). Retroviral expression vectors 
may also be used. 

Thus, the subject matter of the present invention also includes DNA-sequences which 
code for thermostable in vitro complexes or accessory in vitro complexes according to the 
present invention, as well as corresponding vectors, preferably expression vectors. 

The vectors according to the present invention contain at least the gene for the sliding 
clamp protein and preferably at least a gene for a coupling and/or elongation protein, as 
described above, respectively. 

It is preferable within the scope of the present invention if in addition to DNA sequences 
already contained therein, the vector also contains suitable restriction interfaces and, optionally, 
polylinkers for inserting additional DNA sequences. It is especially preferred if the spatial 
arrangement of already present DNA sequence and additional insertion sites result after 
expression in the formation of a fusion protein. 

Further, it is preferred if the vector of the present invention also contains promoters 
and/or operator regions, in which it is especially preferred if such promoters and/or operator 
regions are inducible or repressible; In this way, control of the expression in the host cells is 
simplified considerably, and said control may be efficiently designed. 

Promoter/operator regions of this type may occur repeatedly in an expression vector, 
which optionally enables independent expression of multiple DNA sequences while utilizing just 
one expression vector. 
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Within the scope of the present invention, an especially preferred vector contains DNA 
sequences that encode for all components of an in vitro complex or accessory in vitro complex in 
accordance with the present invention. 

The subject matter of the present invention further includes a host cell which contains one 
or more vectors according to the present invention, wherein proteins may be expressed in said 
host cell under suitable conditions. Suitable conditions include, for example, the presence of an 
inductor or a derepressor. 

Standard protocols exist for transformation, phage infection and cell culture in Maniatis 
et al. (Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, N.Y.). 
Of the multiplicity of available E. coli strains suitable for transformation, those preferred are 
JMlOl (ATCC No. 33876), XLl (Stratagene), RRI (ATTC No. 31343) and BL21 (Pharmacia). 
Protein expression may also ensue e.g. with the help of E. coli strain INVaF (Invitrogen). 

The transformants are cultivated in accordance with suitable growing conditions of the 
host strain. Thus, for example, most E. coli strains are cultivated in LB medium, at 30°C to 42**C 
up to a logarithmic or stationary growth phase. The proteins may be purified from a transformed 
culture, this being effected either from a cell pellet, after centrifiigation or from the culture 
solution as well. In the event the proteins are purified from a cell pellet, said cells are then re- 
suspended in a suitable buffer, then broken open by means of sonification, treatment with 
enzymes or freezing and thawing. For purification from a culture suspension, that is, alone or 
via a fusion protein, the supematant cells are separated off using known methods, such as 
centrifiigation. 

Separation and purification of proteins of the complexes according to the present 
invention, either from the supematant of the culture solution or from the cell extract are effected 
using known methods of separation or purification. Said methods are e.g. of the kind that 
involve solubility, such as salt precipitates, and solvent precipitates, methods which exploit 
differences of molecular weight, such as dialysis, ultra-filtration, gel filtration, and SDS 
polyacrylamide gel electrophoresis, methods which exploit differences in electrical charge, ion 
exchange chromatography, methods which exploit different hydrophobicities, such as reverse- 
phase HPLC (High Performance Liquid Chromatography), methods which exploit specific 
affinities, such as affinity chromatography, and methods which exploit differences in isoelectric 
point, such as isoelectric focusing. It is also contemplated that cell extracts may be made, either 
from the organism that carries the gene of the accessory complex, which achieves the object of 
the present invention, or from the recombinant host organism, for example, E. coli. Using these 
extractions, it would be possible under certain circumstances to avoid additional purification 
steps. 

The methods described above may be utilized in a number of combinations for preparing 
the proteins of the in vitro complex. 

The thermostable prokaryotic accessory in vitro complex of the present invention may be 
used for elongating nucleic acids, e.g. for polymerase chain reactions, DNA sequencing or 
labeling of nucleic acids, as well as other reactions that contain nucleic acid in vitro synthesis. A 
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further possibility is the application in reverse transcription, wherein either the inventive 
complex itself exhibits reverse transcriptase activity, or alternatively a suitable enzyme is added 
which exhibits reverse transcriptase activity. 

Thus, the subject matter of the present invention also includes a method for template- 
dependent elongation of nucleic acids, in which the nucleic acids when necessary are denatured, 
are provided with at least one primer xmder conditions of hybridization, said primer being 
complementary to a flanking region of a desired nucleic acid sequence of the template strand, 
and in which primer elongation occurs with the help of a polymerase in the presence of 
nucleotides, and wherein a thermostable in vitro complex of the present invention is used as a 
polymerase. 

Methods for template-dependent elongation of nucleic acids, in which elongation starts at 
a primer that has been hybridized to the template nucleic acid and which provides a free 3' -OH 
end for the elongation, are known by those skilled in the art. For amplification, a polymerase 
chain reaction in particular is performed. Here, one normally starts with a double-stranded DNA 
sequence, of which a specific target region is to be amplified. Two primers are used which are 
complementary to regions flanking the target sequence on a partial strand each of the DNA 
double strand. In order to hybridize the primers, however, the double-stranded DNA is first 
denatured, in particularly thermally fused. Once the primers have been hybridized, elongation 
occurs by means of the polymerase, the DNA is then denatured once again, and finally the newly 
formed DNA strands are separated from the template strands. Thereupon, both the original 
template strands and the new nucleic acid strands formed in step one are present as templates for 
a second elongation cycle. The former are in turn hybridized with primers and a second 
elongation occurs. This process is performed cyclically with thermal denaturing as an 
intermediate step. 

A thermostable in vitro complex according to the present invention is also used for 
purposes of reverse transcription of RNA to DNA, preferred in accordance with the invention, 
the elongation protein of which exhibits reverse transcriptase activity. Said reverse, transcriptase 
activity may be the sole polymerase activity exhibited by the elongation protein, but it may also 
be present in addition to 5'-3'-DNA-polymerase activity. 

Yet another preferred method according to the present invention involves the sequencing 
of nucleic acids, initiated by a primer that is complementary to a region adjacent the nucleic acid 
being sequenced, and wherein a template-dependent elongation or, in the case of RNA 
sequencing, reverse transcription is performed using deoxynucleotides and dideoxynucleotides in 
accordance with the Sanger method. Within the scope of said preferred embodiment, the 
derivatives described above are also considered suitable deoxynucleotides or dideoxynucleotides, 
respectively. For the method of elongation of nucleic acids of the present invention, it is 
especially preferred if the nucleic acids so formed are labeled. For such purpose, it is feasible to 
use labeled primers and/or labeled deoxynucleotides and/or labeled dideoxynucleotides and/or 
labeled ribonucleotides or corresponding derivatives, respectively, exemplars of which were 
previously described above. 
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Still further, the subject matter of the present invention also includes a method for 
labeling nucleic acids by means of which single breaks are incorporated in the phosphodiester 
bonds of the nucleic acid chain and a nucleotide is replaced at the break sites by a labeled 
nucleotide with the help of a polymerase, wherein the polymerase used is a thermostable in vitro 
complex according to the present invention. 

A method of this type, commonly called nick translation, allows for simple labeling of 
nucleic acids. All of the above described labeled ribonucleotides or deoxyribonucleotides or 
derivatives thereof are suitable for labeling, as long as the polymerase accepts them as substrate. 

Labeling according to the present invention may be accomplished, for example, within 
the scope of or subsequent to a PGR reaction, and for which the use according to the present 
invention of a thermostable in vitro complex is especially favorable. 

The subject matter of the present invention also includes a kit for elongation and/or 
amplification and/or reverse transcription and/or sequencing of nucleic acids, wherein the kit 
may be present in one or more containers, and contains 

a) a thermostable in vitro complex according to the present invention or 

b) a thermostable, accessory in vitro complex and, optionally, a separate elongation 
protein exhibiting polymerase activity and, optionally, primers, buffer substances, 
nucleotides, ATP, additional cofactors and/or pyrophosphate. 

In particular, the subject matter of the present invention includes a kit for the elongation, 
amplification, reverse transcription, labeling and sequencing of nucleic acids, and which 
additionally contains deoxynucleotides or derivatives thereof 

Thus, for the amplification of nucleic acids, a preferred reagent kit according to the 
present invention, in addition to substances a) or b) which exhibit 5'-3 '-polymerase activity, also 
contains deoxynucleotides and/or derivatives thereof Optionally, ribonucleotides or derivatives 
thereof may also be used, namely when a polymerase is used that also accepts ribonucleotides as 
substrate. 

Additional preferred subject matter of the present invention includes a kit for sequencing 
nucleic acids which, in addition to deoxynucleotides or derivatives thereof, also contains 
dideoxynucleotides or derivatives thereof for chain termination. 

Further, preferred subject matter of the present invention includes in particular a kit for 
reverse transcription of nucleic acids in which either the complex of the present invention itself 
exhibits reverse transcriptase activity, or additionally, a suitable enzyme exhibiting reverse 
transcriptase activity is present, in which deoxynucleotides or derivatives thereof are contained 
in the reaction mixture. 

In yet another preferred embodiment said kit contains primers and/or deoxynucleotides 
and/or dideoxynucleotides and/or ribonucleotides and/or respective labeled derivatives thereof 
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For the sequencing of nucleic acids in particular, it is necessary to insert a label. 
Exemplars of suitable labels were described above and are also considered as preferred 
embodiments within the scope of the reagent kit according to the present invention. 

It is also feasible within the scope of the present invention to utilize the reagent kit for 
labeling nucleic acids within the framework of a so-called nick translation. In such case, the 
reagent kit contains the components a) or b) and labeled nucleotides, in which buffer substances, 
ATP or other cofactors and/or pyrophosphate may also be present. As a rule, primers are 
generally not required within the scope of a nick-translation. 

Further, the subject matter of the present invention also includes use of a thermostable 
sliding clamp protein in in-vitro methods for elongation, amplification, labeling and sequencing 
or reverse transcription of nucleic acids. 

Said kit is especially preferred if the sliding clamp protein is present in the form of a 
thermostable homolog to the PCNA-protein (SEQ ID NO: 12, 13, 14, 15), Archaeaoglobus 
Fulgidus-derived PCNA homologs, or homolog of the p-clamp-protein (SEQ ID NO: 35, 36). A 
kit is especially preferred which also contains a coupling protein. 

Further, a kit is preferred which contains an additional protein, wherein the additional 
protein is present as a clamp loader, is a derivative of the group of prokaryotes and is 
thermostable. 

In particular a kit is preferred that contains a buffer as described above. It is also 
preferable if the kit according to the present invention also contains a pyrophosphatase, ATP 
and/or other cofactors. 

The present invention is described in connection with the figures in greater detail below: 

Figures 

Frequently, sequence names are used below, by which the protein and nucleic acid 
sequences are identified in the Genbank and the EMBL database. 

Abb. 1 

Protein sequences, aligned in pairs, and multiple alignments derived from Archaebacteria 
and from corresponding human genes of the replication apparatus. 

In which the annotation: ^ signifies % identity to the corresponding human gene, 
calculated from the paired alignment (see appendix) using BLASTP 2.0.4 [Feb-24-1998], and the 
annotation ^ signifies % identity to the corresponding Archaeoglobus Fulgidus-derived gene, 
calculated from the paired alignment (see appendix) using BLASTP 2.0.4 [Feb-24-1998], and the 
annotation signifies % identity to the corresponding human gene, calculated from the paired 
aligmnent (see appendix) using FASTA 3.1t02 [March, 1998], These procedures are described 
in greater detail in; Altschul, Stephen F., Thomas L. Madden, Alejandro A. SchafiFer, Jinghui 
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Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI- 
BLAST: a new generation of protein database search programs". Nucleic Acids REs. 25: 389- 
3402, and W.R. Pearson & D.J. Lipman, PNAS (1988) 85:2444-2448. The figure shows in the 
case of sliding clamp loader I, sliding clamp loader II, sliding clamp, coupling subunit and 
elongation protein I the sequence names from the databases as well as their SEQ ID NO, in 
which the values indicated in brackets represent in each case the percentage identity per number 
of amino acids. In the case of the elongation protein II the values refer to the percentage 
sequence identity with the Archaeaoglobus fulgidus sequence. 

Fig. 2 

Protein sequences, aligned in pairs, and multiple alignments derived from eubacteria of 
the replication apparatus, in which the annotation ^ signifies % identity to the corresponding E. 
coli-derived gene, calculated from the paired alignment (see appendix) using BLASTP 2.0.4 
[Feb-24-1998]. The procedvire is described in greater detail in: Altschul, Stephen F., Thomas L. 
Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. 
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search 
programs". Nucleic Acids REs. 25: 389-3402. 

Fig- 3 

Outline of a potential variant of the replication apparatus, in which the sliding clamp is 
bonded via a coupling subunit to the elongation protein. 

Fig. 4 

Fig. 4 shows the alignments of two conserved regions of the sliding clamp protein, and 
the consensus sequences derived therefrom. The following genes are shown: PCNA human 
(from SEQ ID NO:l 1), the corresponding sequence derived from Archaeoglobus fulgidus (from 
SEQ ID NO: 12), from Methanococcus janashii (from SEQ ID NO: 13), from Pyrococcus 
horikoschii (from SEQ ID NO: 14), and from Methanococcus thermoautothrophicus (from SEQ 
ID NO: 15). 

Fig. 5 

Fig. 5 shows an alignment of a conserved region of the coupling subunit, and the 
consensus sequences derived therefrom. The following genes are shown: PfuORF2, 
DPD2_HUMAN,AF1790 and MJ0702. The SEQ ID NOs may be found in FigJ. 

Fig. 6 

Fig. 6 shows an alignment of a conserved region of the coupling subunit, and the 
consensus sequences derived therefrom. The following genes are shown: ACl 1_HUMAN, 
AF2060, MTH 0241, PHBN012 and MJ1422. The SEQ ID NOs may be found in Fig.l. 

Fig. 7 
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Fig. 7 shows an alignment of a conserved region of the sliding clamp loader 2, and the 
consensus sequences derived therefrom. The following genes are shown: ACIS HUMAN, 
MJ0884, AF1195, MTH0240 and MTH0240. The SEQ ID NOs may be found in Fig.l. 

Fig. 8 

Fig. 8 shows an alignment of a conserved region of the elongation protein 1, and the 
consensus sequences derived therefrom. The following genes are shown: DPOD_HUMAN, 
MJO0885, MTH1208, PHBT047 and DPOL_ARCFU. The SEQ ID NOs maybe found in Fig.l. 

Fig. 9 

Fig. 9 shows an alignment of a conserved region of the elongation protein 2, and the 
consensus sequences derived therefrom. The following genes are shown: AF1722, MJ1630, 
PfuORF3, MTH1536 and PHBN021. The SEQ ID NOs maybe found in Fig.l. 

Fig. 10 

Fig. 10 shows an alignment of a conserved region of the Eubacteria-derived elongation 
protein, and the consensus sequences derived therefrom. The following genes are shown: 
DP3A_ECOLI: DNA Pol III, alpha subunit, Escherichia coli, BB0579: DNA Pol III, alpha 
subunit, Borrelia burgdorferi, DP3 A_HELPY: DNA Pol III, alpha subunit, Helicobacter pylori 
AA50: Aquifex aeolicus, section 50 and DP3A_SALTY: DNA Pol III, alpha subunit, Salmonella 
typhimurium). 

Fig. 11 

Fig. 10 shows an aUgnment of a conserved region of the Eubacteria-derived sliding 
clamp, and the consensus sequences derived therefrom. The following genes are shown: 
AAPOL3B, DP3B_ECOLI, S.TYPHIM, DP3B_PROMI, DP3B_PSEPU and DP3B_STRCO 
(AAP0L3B: Aqufex Aeolicus section 93: DP3B_ECOLI: DNA Pol III, beta chain, Escherichia 
coli S.TYPHIM: DNA Pol III, beta chain, Salmonella typhimurium P3B_PROMI: DNA Pol III, 
beta chain, Probeus mirabilis DP3B_PSEPU: DNA Pol III, beta chain, Pseudomonas putida 
DP3B_STRCO: DNA Pol III, beta chain, Strptomyces coelicolor). 

Fig. 12 

Multiple alignment of the sliding clamp protein sequence for generating the Hidden 
Markov Model. 

Fig. 13 

Multiple alignment of the eubacterial sliding clamp protein sequence for generating the 
Hidden Markov Model (AAPOL3B: Aqufex Aeolicus section 93: DP3B_ECOLI: DNA Pol III, 
beta chain, Escherichia coli S.TYPHIM: DNA Pol III, beta chain, Salmonella typhimurixim 
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PSB PROMI: DNA Pol III, beta chain, Probeus mirabilis DPSB PSEPU: DNA Pol III, beta 
chain, Pseudomonas putida DP3B_STRCO: DNA Pol III, beta chain, Strptomyces coelicolor). 

Fig. 14 

Multiple alignment of the sliding clamp loader 1 protein sequences for generating the 
Hidden Markov Model. 

Fig. 15 

Multiple alignment of the sliding clamp loader 2 protein sequences for generating the 
Hidden Markov Model. 



Fig. 16 

Multiple alignment of the protein sequences of the coupling subunits for generating the 
Hidden Markov Model. 

Fig. 17 

Multiple alignment of the elongation protein 1 sequences for generating the Hidden 
Markov Model. 

Fig. 18 

Multiple alignment of the elongation protein 2 sequences for generating the Hidden 
Markov Model. 

Fig. 19 

Multiple alignment of the eubacterial elongation protein sequences for generating the 
Hidden Markov Model. The following genes are shown: 

DP3 A_ECOLI: DNA Pol III, alpha subunit, Escherichia coli, BB0579: DNA Pol III, alpha 
subunit, Borrelia burgdorferi, DP3 A_HELPY: DNA Pol III, alpha subunit, Helicobacter pylori 
AA50: Aquifex aeolicus, section 50 and DP3A_SALTY: DNA Pol III, alpha subunit, Salmonella 
typhimurium). 

Example 

Using known methods, the DNA is purified fi-om the Archaeoglobus flilgidus organism 
(DSM No. 4304). Here organisms are cultured by means of the DSM (German Collection of 
Microorganisms). To clone the appropriate genes (sliding clamp loader I/II, sliding clamp, 
elongation proteins I/II, coupling subimit) in the expression vector pjrc99, primers for each gene 
are developed which encompass the fiill open reading frame, as well as start and stop codon. In 
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the process, restriction ends are added to the primers, which facilitate directed cloning in the 
expression vector. Using approximately 200 ng of total-genomic DNA, PGR reactions are 
driven by suitable annealing temperatures (approximately 35 cycles), and the resulting product is 
purified. After purification, the products are treated with restriction enzymes and purified via an 
agarose gel to be readied for ligation. Using restriction enzymes, the expression vector is 
linearized, purified and attenuated in such a way as to prepare it for ligation with the amplicants 
of the accessory gene fi:*om the aforementioned PGR. Ligation reaction is prepared and 
following incubation an aliquot is transformed in the E. coli strain INValphaF' (Invitrogen). 
Three positive colonies are picked fi-om each gene, plasmid DNA is prepared, and the inserts are 
checked via DNA sequencing for completeness and accuracy. Gorrect clones are picked, then 
recovered once again for singling out on agar plates (ampicillin). Golonies are then picked and 
overnight cultures prepared. An aliquot (500|al) of the overnight culture is placed in a 1 to 51 
culture of LB (Ampicilin: 80- mg/1). The cultures grow until they reach an OD66O of 0.8 at 37^G, 
at which point IPTG (125 mg/1) is added for induction. The cultures continue to grow for an 
additional 1 1 hoxirs. The cultures are then centrifiiged and the pellets are absorbed in a buffer 
(Buffer A: 50 mM Tris-HGL pH 7.9, 50 mM dextrose, 1 mM EDTA). Following centrifugation, 
the cells are re-absorbed, but now Buffer A also contains Lysozyme (4 mg/ml). Following 
incubation (15 min), an equal volume of Buffer B is added (B: 10 mM Tris-HGL pH 7.9, 50 mM 
KGl, 1 mM EDTA, 1 mM PMSF, 0.5 % Tween 20, 0.5% Nonidet P40) and the lyse is added for 
incubation at 75^G for one hour. Following centrifugation, the supernatant is removed and the 
overexpressed proteins are precipitated out using (NH4)2S04. Following centrifugation, the 
pellets are collected and the proteins are re-suspended with Buffer A. The re-suspended proteins 
are dialyzed against storage buffer (50 mM Tris-HGL pH 7.9, 50 mM KGl, 0.1 mM EDTA, 1. 
mM DTT, 0.5 mM PMSF, 50% glycerol) and subsequently stored at -70**G. 

To test the activity of the proteins, reactions are composed as follows: Aliquots of the 
proteins are combined in different configurations and molarities, sliding clamp loader I/II with 
sliding clamp, coupling subunit and elongation protein I, or sliding clamp loader I/II with sliding 
clamp, with and without coupling subunit and elongation protein II, or sliding clamp and 
elongation protein I or II, and finally just elongation protein I or II; the buffer is the 
aforementioned storage buffer. DNA polymerase activity is measured by incorporating (methyl- 
^H) TTP in trichloric acid-insoluble material (Ishino, Y., Iwasaki, H., Fukui, H., Mineno, J., 
Kato, L, & Schinigawa, H. (1992) Biochimie 74:131-136). To determine processivity the 
aforementioned protein mixtures are used in primer-elongation experiments. A Ml 3 single- 
stranded template is inmiersed in 10 mM Tris-HGl (pH 9.4) and heated (92^G) together with a 
universal primer (labeled 5'-FITG), then cooled (room temperature). Attenuation series of the 
template-primer mix thus generated are concentrated in a reaction consisting of nucleotides 
(approximately 200 ^m to 1 mM), reaction buffer (final concentration: 50 mM KGl, 10 mM Tris- 
HGl ) pH 8.3), 1.5-5 mM MgGb, ATP (0 mM-200 mM) and protein stabilizing agents, and 
incubated for 10 minutes at 37°G, 52°G, 62^G, 68**G, 74^G, 78°G. One aliquot is loaded for 
analysis on an automatic sequencer (e.g. Alf, Pharmacie Biotech). The aforementioned protein 
mixtures are also used to measure fidelity and exonuclease activity, in which the method 
described in Kohler et al. is used (Proc. Nati. Acad. Sci. USA 88:7958-7962 (1991) or Ghase et 
al. (J. Biol. Ghem., 249: 4545-4552 (1972). Said protein mixtures are also used in PGR 
(Methods in Molecular Biology, Vol. 15, Humana Press, Totowa, New Jersey, 1993, edited by 
Bruce A. White). 
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Claims 



1 . Thermostable in vitro complex for template-dependent elongation of nucleic acids, 
comprising a thermostable sliding clamp protein which is connected with an elongation protein 
that exhibits polymerase activity. 

2. Thermostable complex according to claim 1, characterized in that the sliding clamp 
protein and the elongation protein are connected via a coupling protein. 

3. Thermostable complex according to claim 1 or 2, characterized in that the sliding clamp 
protein and the elongation protein are derived from archaebacteria. 

4. Thermostable complex according to claim 1 or 2 or 3, characterized in that said sliding 
clamp protein has a ring-like structure the encircles entirely or in part the template-nucleic acid 
strands. 

5. Thermostable complex according to any of claims 1-4, characterized in that the sliding 
clamp protein has one or both of the following consensus sequences: 

[G A V L I M P F W]-D-X-X-X-[G A V L I M P F W]-X-X-[G A V L I M P F W]-X-[G A V L I 
M P F W]-X-[G A V L I M P F W]-X-X-X-X-F-X-X-Y-X-X-D and/or 
[G A V L I M P F W]-X(3)-L-A-P-[K R H D E]-[G A V L I M P F W]-E. 

6. Thermostable complex according to any of claims 1-5, characterized in that the sliding 
clamp protein exhibits in a sequence alignment to the human (eukaryotes) PCNA amino acid 
sequence (SEQ ID NO: 11) having a length of at least 100 amino acids at least a 20% sequence 
identity, and/or the sliding clamp protein exhibits in a sequence alignment to the bacterial P- 
clamp sequence derived from E. coli (eubacteria) (SEQ ID NO: 35) having a length of at least 
100 amino acids at least a 20% sequence identity, and/or the sliding clamp protein exhibits in a 
sequence alignment to the amino acid sequence of the PCNA homologs derived from 
Archaeoglobus Fulgidus (Archaea) (SEQ ID NO: 12) having a length of at least 100 amino acids 
at least a 20% sequence identity. 

7. Thermostable complex according to any of the preceding claims, characterized in that the 
sliding clamp protein with a Hidden Markov Model generated from an alignment in Fig. 12 
yields a score of at least 20, and/or in that the sliding clamp protein with a Hidden Markov 
Model generated from an alignment in Fig. 13 yields a score of at least 25. 

8. Thermostable complex according to any of the preceding claims, characterized in that the 
sliding clamp protein is selected from Archaeoglobus Fulgidus-derived AF0335, from 
Methanococcus jaimaschii-derived MJ0247, from Pyrococcus horikoschii-derived PHLA 008, 
from Methanobacterium thermoautotrophicus-derived MTH1312 and from Aquifex aeolicus- 
derived AE000761_7 

9. Thermostable complex according to any of the preceding claims, characterized in that the 
elongation protein exhibit 5*-3 -polymerase activity or reverse transcriptase activity. 
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10. Thermostable complex according to claim 9, characterized in that the elongation protein 
contains at least one of the following consensus sequences and deviates from said sequence at no 
more than four positions: 

SEQ ID NO: 44 

D-[G A V L I M P F W]-[G A V L I M P F W]-X-X-Y-N-X-X-X-F-D-X-P-Y-[G A V L I M P F 
W]-X-X-R-A 

SEQ ID NO: 45 

A-[G A V L I M P F W]-R-T-A[G A V L I M P F W]-A-[G A V L I M P F W].[G A V L I M P 
F W]-T-E-G-[G A V L I M P F WJ-V-X-A-P^G A V L I M P F W]-E-G-I-A-X-V-[K R H D E]- 
I. 

SEQ ID NO: 46 

[G A V L I M P F W]-P-V-G-[G A V L I M P F W]-G-R-G-S-X-[G A V L I M P F W]-G-S-[G 
A V L I M P F W]-V-A-X-A-[G A V L I M P F W]-X-I-T-D-[G A V L I M P F W]-D.P-[G A V 
L I M P F W]-X-X-X-[G A V L I M P F W]-L-F-E-R-F-L-N-P-E-R-[G A V L I M P F W]-S-- 
M-P-D. 

1 1 . Thermostable complex according to claim 9, characterized in that the elongation protein 
exhibits in a sequence alignment to the human (eukaryotic) amino acid sequence (SEQ ID NO: 
22) having a length of at least 200 amino acids at least a 20% sequence identity and/or in a 
sequence alignment to the archaeabacterial amino acid sequence (SEQ ID NO: 27) having a 
length of at least 400 amino acids at least a 25% sequence identity and/or in a sequence 
alignment to the eubacterial amino acid sequence (SEQ ID NO: 37) having a length of at least 
300 amino acids at least a 25% sequence identity. 

12. Thermostable complex according to claim 9, characterized in that the elongation protein 
with a Hidden Markov Model generated from an alignment in Fig. 17 yields a score of at least 
20, and/or in that the sliding clamp protein with a Hidden Markov Model generated from an 
alignment in Fig. 1 8 yields a score of at least 35 arid/or in that the sliding clamp protein with a 
Hidden Markov Model generated from an alignment in Fig. 19 yields a score of at least 20. 

13. Thermostable complex according to any of the preceding claims, characterized in that the 
elongation protein is selected from Archaeoglobus Fulgidus-derived AF0497 or API 722, from 
Methanococcus jannaschii-derived MJ0885 or MJ163, from Pyrococcus horikoschii-derived 
PHBT047 or PHBN021, from Methanobacterium thermoautotrophicus-derived MTH1208 or 
MTH1536 and from Pyrocuccus ftiriosus-derived PFU0RF3. 

14. Thermostable complex according to any of the preceding claims, characterized in that the 
coupling protein contains the following consensus sequence and deviates from said sequence at 
no more than four positions: 

(SEQ ID NO:43) 

[FL]-G A V L I M P F W]-X-X-[G A V L I M P F W]-X.G-X(13)-[G A V L I M P F W]-X- 
[YR]-[G A V L I M P F W]-X-[G A V L I M P F W]-A-G-[DN]-[G A V L I M P F W]-[G A V 
LIMPFWHDS]. 
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15. Thermostable complex according to any of the preceding claims, characterized in that the 
coupling protein exhibits in a sequence alignment to the human (eukaryotic) amino acid 
sequence (SEQ ID NO: 16) having a length of at least 150 amino acids at least a 18% sequence 
identity. 

16. Thermostable complex according to any of the preceding claims, characterized in that the 
coupling protein with a Hidden Markow Model generated from an alignment from Fig. 16 yields 
a score of at least 10. 

17. Thermostable complex according to any of the preceding claims, characterized in that the 
coupling protein is selected from Archaeoglobus Fulgidus-derived AF179G, from 
Methanococcus jannaschii-derived MJ0702, from Pyrococcus horikoschii-derived PHBN023, 
from Methanobacterium thermoautotrophicus-derived MTH1405 and from Pyrococcus fiiriosus- 
derived PFUORF2. 

18. Thermostable complex according to any of the preceding claims, characterized in that the 
complex is associated with a protein that functions as a sliding clamp loader. 

1 9. Thermostable complex according to any of the preceding claims, characterized in that the 
presence of said complex is associated with ATP or another cofactor. 

20. Thermostable accessory in vitro complex, characterized in that it contains a sliding clamp 
protein and a coupling proteins as defined, respectively, in any of the preceding claims. 

21 . Recombinant DNA-sequence, characterized in that it encodes for a thermostable in vitro 
complex according to any of claims 1 to 19 or for a thermostable accessory complex according to 
claim 20. 

22. Vector, characterized in that it contains a recombinant DNA sequence that encodes for 
sliding clamp protein and a coupling protein and/or an elongation protein. 

23. Vector according to claim 22, characterized in that it also contains suitable restriction 
interfaces for inserting additional DNA sequences, so arranged that a fusion protein is produced 
from the sliding clamp protein and the expression product of the additional DNA sequences. 

24. Vector according to claim 22 or 23, characterized in that it contains suitable promoter 
and/or operator regions which control expression of the DNA sequence(s) 

25. Vector according to claim 24, characterized in that it contains multiple promoter and/or 
operator regions for separate expression of multiple DNA sequences. 

26. Vector according to any of claims 22 to 25, characterized in that it contains repressible 
and inducible promoter and/or operator regions. 
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27. Vector according to any of claims 22 to 26, characterized in that it contains a DNA 
sequence in accordance with claim 21 . 

28. Host cell, characterized in that it is transformed with one or more vectors according to 
any of claims 22 to 27. 

29. Method for producing a thermostable in vitro complex according to any of claims 1 to 19, 
or a thermostable, accessory in vitro complex according to claim 20, characterized in that a 
suitable recombinant DNA sequence according to claim 21 or one or more corresponding vectors 
according to any of claims 22 to 27 are imparted in a host cell, causing the proteins to be 
expressed, said expressed proteins are then isolated from culture medium or after opening of the 
cell, and optionally coupled with additional components of the complex. 

30. Use of a theremostable, accessory in vitro complex according to claim 20 for producing a 
thermostable in vitro complex according to any of claims 1 to 19, characterized in that the 
accessory complex is coupled with an elongation protein exhibiting polymerase activity. 

3 1 . Method of template-dependent elongation of nucleic acids, in which the nucleic acids are, 
if necessary, denatured, provided with at least one primer under hybridizing conditions, in which 
said primer is complementary to a flanking region of a desired nucleic acid sequence of the 
template strand, and in which primer elongation occurs with the help of a polymerase in the 
presence of nucleotides, characterized in that the polymerase used is a thermostable in vitro 
complex according to any of claims 1 to 19. 

32. Method according to claim 3 1 , characterized in that two primers flanking the desired 
nucleic acid and deoxynucleotides and/or derivatives thereof and/or ribonucleotides or 
derivatives thereof are used for amplifying DNA sequences. 

33. Method according to claim 32, characterized in that a polymerase chain reaction is 
carried out. 

34. Method according to claim 3 1 , characterized in that for reverse transcription of RNA to 
DNA a thermostable in vitro complex according to any of claims 1 to 19 is used, the elongation 
protein of which exhibits reverse-transcriptase activity. 

35. Method according to any of claims 3 1 to 34, characterized in that sequencing of nucleic 
acids initiated by a primer which is complementary to a region adjacent to the nucleic acid being 
sequenced is carried out by way of a template-dependent elongation or reverse transcription 
using deoxynucleotides and dideoxynucleotides or derivatives thereof in accordance with the 
Sanger method. 

36. Method according to any of claims 3 1 to 35, characterized in that labels are inserted 
during elongation of the nucleic acids. 
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37. Method according to claim 36, characterized by the use of labeled primers and/or labeled 
deoxynucleotides and/or derivatives thereof and/or labeled dideoxynucleotides and/or derivatives 
thereof and/or labeled ribonucleotides and or derivatives thereof. 

38. Method for labeling nucleic acids by producing single breaks in phosphodi ester bonds of 
the nucleic acid chain and with the help of a polymerase, replacing the nucleotide at the break 
sites with a labeled nucleotide, characterized in that the polymerase used is a thermostable in 
vitro complex according to any of claims 1 to 19. 

39. Reagent kit for elongation and/or amplification and/or reverse transcription and/or 
sequencing and/or labeling of nucleic acids, containing in a single or in multiple containers 

a) a thermostable in vitro complex according to any of claims 1 to 19 or 

b) a thermostable accessory in vitro complex according to claim 20 and optionally, a separate 
elongation protein exhibiting polymerase activity, 

and, optionally, primers, buffer substances, nucleotides, ATP, one or more cofactors and/or 
pyrophosphate. 

40. Kit according to claim 39, characterized in that for the purpose of nucleic acid 
amplification it contains in addition to substances a) or b), which exhibit 5 '-3 '-polymerase 
activity, deoxynucleotides and/or derivatives thereof 

41 . Kit according to claim 39, characterized in that for the purpose of reverse transcription it 
contains substances a) or b), which exhibit 5 '-3 '-reverse transcriptase activity, as well as 
deoxynucleotides and/or derivatives thereof 

42. Kit according to any of claims 39 to 41, characterized in that for the purpose of 
sequencing it contains in addition to deoxynucleotides or ribonucleotides and/or derivatives 
thereof, dideoxynucleotides and/or derivatives thereof. 

43. Kit according to any of claims 39 to 42, characterized in that it contains primers and/or 
deoxynucleotides and/or dideoxynucleotides in labeled form. 
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Beschreibung 

Die Erfindung betrifift einen thermostabilen in vitro-Komplex zur Template-abhangigen Elongation von Nukleinsau- 
ren, einen thermostabilen prokaryontischen akzessorischen in vitro Komplex sowie dafiir codierende DNA-Sequenzen 
5 und Vektoren. Die Erfindung betrifft weiterhin die Verwendung der erfindungsgemaBen Komplexe in Verfahren zur Tem- 
plate-abhangigen Elongation von Nukleins^uren, wie PCR-Reaktionen oder der DNA Sequenzierung, bei denen in vitro 
Template-abhangige DNA Strangsynthese erfolgt. SchlieBlich betrifft die Erfindung noch Reagenzienkits zur Durchfiih- 
rung der erfindungsgemaBen Verfahren. 

DNA Polymerasen.gehoren zu einer Gruppe von Enzymen, die einzelstrangige DNA als Template liir die Synthese ei- 
10 nes komplementaren DNA Stranges verwenden. Diese Enzyme spielen eine bedeutende Rolle im Nukleinsaurestoff- 
wechsel, einschiicBlich der Prozesse DNA Replikation, Reparatur und Rekombination. DNA Polymerasen warden in al- 
ien zellularen Organismen identifiziert, von bakteriellen bis zu menschlichen Zellen, in vielen Viren sowie in Bakterio 
phagen (Komberg, A. & Baker, T. A. (1 991 ) DNA Replikation WH Freeman, New York, NY). Man faBt in der Regel die 
Archaebakterien und die Eubakterien zusainmen zu der Gruppe der Prokaryonten, der Organismen ohne echtem Zell- 
kem, und stellt ihnen die Eukaryonten, die Organismen mit echtem Zellkem, gegenuber. Gemeinsam sind vielen Poly- 
merasen aus den verschiedensten Organismen oftmals Ahnlichkeiten in der Aminosauresequenz sov^e Ahnlichkeiten in 
der Stniktur (Wang, J., Sattar, A.K.M.A.; Wang, CC, Karam, JD., Konigsberg, W.H. & Steitz, TA. (1997) Crystal 
Structure of pol a familily lepUcation DNA polymerase from bacteriophage RB69.CeU 89, 1087-1099). oiganismen wie 
der Mensch besitzen eine \^eizahl von DNA abhangigen Polymerasen, von denen jedoch nicht alle fiir die DNA Repli- 
20 kation zustandig sind, sondem einige auch DNA Reparatur durchfiihren. RepUkative DNA Polymerasen bestehen meist 
aus Protein komplexen mit mehreren Untereinheiten, welche die Chromosomen der zellularen Organismen und Viren re- 
plizieren. Eine generelle Eigenschaft dieser replizierenden Polymerasen ist im allgemeinen eine hohe Prozessivitat, das 
heiBt, deren Fahigkeil, Tausende von Nukleotide zu polymerisieren ohne vom DNA Template abzudissoziieren (Kom- 
berg, A. & Baker, T A. (1 991) DNA Replikation. WH Freeman, New Yo±, NY). 
25 Bis vor kurzem waren die einzigen gut verstandenen, hochprozessiven Replikations-Mechanismen in Zellen solche, 
die von zellularen Replikasen (eineni Protein-Komplex mil Poly merase-Akti vital) vetwendet wurden, sowie der Repb*- 
kaiionsapparat dcs Baktcriophagcn T4 odcr T7. Der dcm Mcchanismus zugrundc licgcn^c Apparat bcinhaltct cin Protein 
mit ringahnlicher Suuktur, eine "Gleitklammer", welche die DNA umschlieBt und die katalytische Polymeraseeinheit - 
das "Elongationsprotein" - an die DNA bindet (Stukenberg, P.T, Studwell-Vaughan, RS.& ODonell. M. (1991) Mecha- 
30 nism of the p-clamp of DNA polymerase m holoenzyme. J. Biol. Chem. 266, 1 1 328-1 1 334; Kuriyan, J. & ODonnel, M 
(1993) Sliding clamps of DNA polymerases. J. MoL BioL 234, 915-925). Die Gleitklammer ist haufig (iber ein ^er 
mehrere weitere Proteine, den "Kopplungsproteinen", an das Elongationsprotein gebunden. Die dreidimensionale Struk- 
tur von verschiedencn Gleitklammerproteinen wurde bereits bestimmt: 

35 - die des eukaryontischen proliferating cell nuclear antigen (PCNA) (Krishna, T.S Jl., Kong, X.-R, Gary, S., Bur- 

gers, R M. & Kuriyan, J. (1994) Crystal suructure of the eukaryotic DNA polymerase processivity factor PCNA. 
CeU 79. 1 233- 1 243; Gulbis, J. M., Kelman, Z., Hurwitz, J., ODonnel, M. & Kuriyan, J. (1 996) Structure of the C- 
termmal region of p21WAFl/CIPl complexed with human PCNA. Cell 87, 297-306), 

- die der p Uniereinheii der Polymerase HI des Eubakteriums Escherichia coH), (Kong, X.-R, Om^st, R., ODon- 
40 nell, M. & Kuriyan, J. ( 1 992) Three dimensiwial structure of the p subunit of Escherichia coli DNA polymerase m 

holoenzyme; a sliding DNA clamp. Cell 69, 425^37) 

- und die des Bakteriophagen T4 Gen45 Proteins (Kelman, Zvi, Hurwitz, J. ODonnel, Mike (1998) Struaure 6 
121 125). ^ 7 , , 

45 Die Gesamtstruktur dieser Gleitklammem ist sehr ahnlich; die Bilder der Proteingesamtstruktur von PCNA, der ^-Un- 
teremheit und der gp45 Ringe sind iibereinandergelegt deckungsgleich (Kehnan, Z. & ODonneU, M. (1995) Structural 
and functional similarities of prokaryotic and eukaryodc sliding clamps. Nucleid Acids Res. 23, 3613-3620). Jeder Ring 
hat vergleichbare Dimensionen und eine zentrale Offiiung, die groB genug ist, um Duplex-DNA, also einen DNA Dop- 
pelstrang, bestehend aus den zwei komplementaren DNA Strangen, zu umschlieBen. 

50 Die Gleitklammer kann sich in vivo nicht selbst um die DNA herum positionieren; sie muB durch einen Klammerlader 
um die DNA fixiert werden. Dieser Klammeriader ist ein Protein komplex, der in Prokaryonten und Eukaryonten aus ei- 
ner Vielzahl von Untereinheiten besteht und der beim Eubakterium Escherichia coli y-Komplex beziehungsweise beim 
Menschen Replikationsfaktor C (RF-C) genannt wird (Kelman, Z. & ODonneU, M. (1994) DNA replicadon - enzymo- 
logy and mechanisms. Curr. Opin. Gent. Dev. 4, 1 85-195). Der Gleitklammerlader erkennt das 3'-Ende des Einzelstrang- 

55 Duplexes (Primer-Template) und positionierl die Gleitklammer in Anwesenheit von ATP um die DNA. Die GleitMam- 
mer, welche die DNA dann umschlieBu wechselwirkl mit Polymerasen, und gewahrleistet so eine schnelle und prozes- 
sive DNA Synthese. 

Im Falle des Bacteriophagen T7 wird das gleiche Ziel, eine prozessivc DNA Synthese, mittels eines strukturcll ande- 
ren Proteinkomplexes erreicht. Der Phage exprimierl eine eigene katalytische Polymerase, die T7 Polymerase, das Pro- 

60 dukt des gene 5, welche mit einem Protein aus dem Wirt Escherichia coli, dem Thioredoxin eine Bindung eingeht und als 
RepUkase eine hochprozessive DNA Replikation ermoglicht (Proc Nad Acad Sci USA 1992 Oct 15; 89(20): 9774-9778 
Genetic analysis of the interaction between bacteriophage T7 DNA polymerase and Escherichia coli thioredoxin, Hima- 
wan JS, Richardson CC). Auch hierbei kommt es zur Klammerbildung, jedoch weist diese Klammer nicht die gleiche 
Struklur auf, wie z. B. im Falle des eukaryontischen PCNA. 

65 Oft ist es noug - wie zum Beispiel im Falle der humanen Polymerase 5 -, daB Proteine (Kopplungsproteine) die Ver- 
bmdung zwischen dem katalydsch akdven Teil der Polymerase und dem Prozessivitatsfaktor (Gleitklammer) schaflfen. 
Beim Menschen ist dies die kleine Untereinheit der 6-Polymerase (Zhang, S.-J., Zeng, X.-R., Zhang, R, Toomey, N.L., 
Chuang, R.Y., Chang, L.-S., and Lee, M.YW.T. (1994). A conserved region in the amino terminus of DNA polymerase 6 
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is involved in iMX)liferating cell nuclear antigen binding. J. Biol. Chem. 270, 7988-7992). Im Falle der T7 Polymerase je- 
doch bindet der Prozessivilatsfaktor die katalyiische Einheil der Polymerase direkL 

DNA Polymerasen werden unter anderem durch zwei Eigenschaften charaklerisierl, ihre Elongaiionsrate, das heifit die 
Anzahl der Nukleotide die sie pro Sekunde in einen wachsenden DNA Strang inkorporieren konnen und ihre Dissoziati- 
onskonstanie. Wenn die Polymerase nach jedem Inkorporationsschritt eines der Nukleotide in die wachsende Ketie wie- 
der vom Strang abdissoziiert (d. h. ein Elongaiionsschritt erfolgt pro Bindungsereignis). dann hat die Prozessivitat den 
Wert 1 und die Polymerase ist nicht prozessiv. Wenn die Polymerase tlir wiederholte Nukleinsaureinkorporationen mit 
dem Strang veibunden bleibt, dann wird der Replikalionsmodus als prozessiv bezeachnet und kann einen Wert von meb- 
reren 'J'ausend eireichen (siehe hi^u auch: Methods in Hnzymology Nfolume 262, DNA Replication, Edited by J L 
Campbell, Academic press 1995, pp. 270-280). 

Fiir die moisten in vitro Anwendungen, wie PGR oder Sequenzierungsprozesse ist Prozessivitat eine wunschenswcrte 
Eigenschaft, die allerdings die bislang in diesen Reakiionen eingesetzten thermostabilen Enzyme nur in geringem MaBe 
hesitzen, wohingegen die temperalursensili ve, mit Thioredoxin assoziierte T7 Polymerase eine Pro/^ssivitat von einigen 
tausendNukleotiden hat Im Vergleich - die thermostabile DNA Polymerase aus Thermus thermophilus oder aqualicus 
haben nur erne Prozessivitat von etwa 50 Nukleotiden (Biochim Biophys Acta 1995 Nov 7; 1264(2): 243-248 Inactiva- 
ticm of the 5-3' exonuclease of Thermus aquaticus DNA polymerase. Merkens LS, Bryan SK, Moses RE) 

Die U.S. Patente 4,683,195, 4,800,195 und 4,683,202 beschreiben die Anwendung solcher thermostabilen DNA Poly- 
merasen m der Polymerase KeUenreaktion (PGR). In der PGR wird unier Venvendung von Primem, Template, Nukleo- 
tiden, einer DNA Polymerase eines entsprechenden Puffers und geeigneten Reaktionsbedingungen DNA neu syntheti- 
siert Hierbei wird die doppelsu^gige Zielsequenz zumeist thermisch aufgeschmolzen. zwei Oligonukleotide werden 
anhybridisiert und die Komplementarsequenz mittels der Inkorporation von Nukleotiden durch die Polymerase am Tem- 
plate synthetisiert Das Extensionsprodukt jedes Primers dient als Template fiir den nachsten Zyklus. In dieser PCR 
kommt es bevorzugt zur Verwradung einer thermostabile Polymerase, welche das zyklische, thermische Aufschmelzen 
der DNA Strange iibersteht So wird haufig Taq DNA Polymerase verwendet (U.S. PatCHt 4,965,188). Die Prozessivitat 
der-Taq DNA Polymerase ist jedoch wie oben ausgefiihrl reiativ gering im Vergleich 2airT7 Polymerase. 

DNA Polymerasen flnden auch in der DNA Sequenzbcsliuunung Anwendung (Salter et aL, Proc. Nad. Acad. Sci., 
USA 74: 5463-5467 (1997)). Haufig wird bci der Scqucnzicrung nach Sanger cine T7 DNA Polymerase vcrwcndct (TV 
bor, S. und Richardson, C.C. Proc. NaU. Acad. Sci., USA 86: 4076-4080 (1989)).-Spater wurde das Cycle-Sequencing- 
Verfahren entwickelt (Murray, V. (1989) Nucleic Acids Res. 17. 8889), welches kein einzelstrangiges Template erfordert 
und die IniUierung der Sequenzreaktion mit verhaltnismaBig gmngen Mengen an Template erlaubl. Die hierbei zur Ver- 
wendung kommenden Polymerasen konnen z. B. die oben erwahnte Taq Polymerase sein (U.S. Patent 5 075 216) oder 
die Polymerase von Thermotoga neapolitana (WO 96/10640) oder andere thermostabile Polvmerasen. Neuere Verfahren 
koppeln die exponentielle Amplifikation und die Sequenzierung eines DNA Fragmentes in einem Schritt, so dafi es mog- 
lich ist, genomische DNA direkt zu sequenzieren. Eines der Verfahren, das sogenannte DEXAS- Verfahren (Nucleic 
Acids Res 1997 May 15; 25(10): 2032-2034 Direct DNA sequence determination from total genomic DNA Kileer C 
Paabo S, Biol Chem 1997 Feb; 378(2): 99-105 Direct exponential ampUficaUon and sequencing (DEXAS) of genomic 
DNA. Kilger C\ Paabo S und DE 196 53 439.9 sowie DE 196 53 494.1) verwendet eine Polymerase mit verminderter 
Disknmimerungsfahigkeit gegeniiber Dideoxynukleotiden (ddNTPs) im Vergleich zu Deoxynukleotiden (dNTPs) sowie 
einen Reaktionspuffer, zwei Primer, die preferentieU nicht equimolar vcMrHegen, und die oben genannten Nukleotide urn 
dann in raehreren Zyklen eine komplette, sequenzspezifische DNA Leiter eines Fragmentes zu erhalten welches von den 
Pnmem umspannl ist. Hne Weiiereniwicklung dieses Verfahrens besteht in der Verwendung eines Polymerasegemi- 
sches, wobei eine der beiden Polymerase zwischen ddNTPs und dNTPs diskriminiert, wahrend die zweite eine vermin- 
derte Diskriminierungsfahigkeit aufweist (Nucleic Acids Res 1997 May 15; 25(10): 2032 2034 Direct DNA sequence 
determination from total genomic DNA. Kilger C, Paabo S). 

DNA Polymerasen finden auch Anwendung in der reversen Transkription von RNA in DNA. Hierbei dient RNA als 
Template und die Polymerase synihetisim einen komplementaren DNA SUrang. Zur Anwendung kommt hier z.B. die 
thermostabile DNA Polymerase aus dem Organismus Thermus thermusphilus (Tth) (U.S. Patent 5,322,770). 

Es kann zudem erwunscht sein, daB die Polymerase eine 'proof-reading' Aktivitat besitzt, also eine 3'-5' Exonuklease- 
aktivitat aufweisu Diese Eigenschaft isi insbesondere dann wiinschenswert, wenn das zu synthetisierende Produkl mit ei- 
ner niedngcn Fehlerrate bei derNukleotidinkorporation hergestelli werden soli. 

Die oben genannten Enzyme, die ublicherweise in PCR-Reakdonen eingesetzt werden, gehoren groBtenteils nicht zu 
den eigenllichen RepUkationsenzymen, sondem es sind zumeist Enzyme, von denen man annimmt, daB sie an der DNA 
Reparatur beteiligt sind, weshalb deren Prozessivitat reiativ gering isL 

Somit war es Aufgabe der vorliegenden Erfindung, mehrere der vorgenannlen Eigenschaften von Polymerasen, insbe- 
sondere hohe Prozessivitat und Thermostabilitat fiir die Verwendung in in viu-o Reaktionen zu vereinen. 

Diese Aufgabe wurde erfindungsgemaB geldst durch die BereitsteUung eines thermostabilen in viiro^kdiiiplexes zur 
Template-abhangigen Elongation von Nukleinsauren, umfassend ein thermostabiles Gleitklammerprotein, welches mit 
emem thermostabilen Polymeraseaktivitat-aufweisenden Elongationsprotein verbunden ist. Dieser Komplex kann in in 
vitro-Reaktionen, wie z. B. in PCR-Reaktionen, eingesetzt werden und weist dabei eine hohe Prozessivitat auf. \brteil ist 
zudem, wenn der Komplex eine geringe Fehlerquote bei derNukleotidinkorporation aufweist, also eine erhohte Fidelity 
hat. Dieser Komplex kann somit bei der Elongation, der Amplifikation und der Sequenzierung von Nukleinsauren einge- 
setzt werden. Dieser Komplex ist vorzugsweise in Siandard-PCR-Reaktionen einsetzbar. 

Urn die Verwendbarkeit eines solchen Komplexes in Standard-PCR-Reaktionen zu ermoglichen, muB eine einfache 
Handhabung gewahrleistet sein. So wurde zwar der Replikationsapparat des Simian virus 40 bereits in vitro zusammen- 
gesetzt (Wage, S. & Siillman, B. (1994) Nature, Vol. 369, 207-221), jedoch fUrSiandard-PCR-ReakUonen ist dieser Re- 
plikationsapparat nicht geeignet, da er unter anderem nicht thermostabil ist. Es ist somit Gegcnstand der Erfindung einen 
thermostabilen in viUro-Komplex mit Polymeraseakli vital bereitzusteUen, der in Standard-PCR-Reakuonen einsetzbar ist 
und der eine hohe Prozessivitat aufweist. 
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Insbesondere ist Gegensland der Erfindung ein Ihennoslabiler prokaryontischer in vitro Komplex zur Elongation von 
Nukleinsauren, der ein thermostabiles Gleitklammerprotein, welches die komplementaren Nukleinsaurestrange ganz 
Oder teilweise umschlieBt, und ein thermostabiles, Polymeraseaktivital aufweisendes Protein umfaBt, wobei dieses Pro- 
tein Oder dieser Proteinkomplex mit dem Gleitklammerprotein gekoppelt ist. 
5 Im Rahmen der vorliegenden Erfindung umfaBt der Begriff Polymeraseaktivitat-aufweisendes Elongationsprotein 
auch Polymeraseaktivitat-aufweisende Protein komplexe oder Untereinheiten solcher Komplexe, welche die Polymera- 
seaktivital tragen. 

Thermostabil im Sinne der vorhegenden Erfindung bedeutet, daB der akzessorische Komplex mit hoher Prozessivitat 
Nukleotide in wachsende Nukleinsaurestrange inkorporiert sowohl bei niedrigen als auch bei hohen 'ifemperaturen, die in 
10 der PCR oder einer anderen Reaktion aufureten, wie z. B. der DNA Sequenzierung. 

Die PGR besteht z. B. in der Regel aus den Schritlen der Denaturierung (70**G bis 98*'G), dem Annealing (40°C bis 
78^C) und der DNA Strangsynthese i6(fC bis 76''G). Somit muB dieser Komplex mindestens zwischen ca. und ca. 
70^C, insbesondere zwischen 60%: und 76°G und hesonders bevorzugterweise funktionsfahig sein zwischen 40°C und 
98°G. Es durfen wahrend der gesamten Reaktion keine irreversiblen Denaturierungserscheinungen des Komplexes oder 
15 einzelner Komponenten auftreten, welche die Elongationsreaktion unterbinden oder inhibieren. 

Die Kopplung zwischen Gleitklammerprotein und Polymeraseaktivital aufweisenderii Elongationsprotein kann durch 
kovalenle, aber auch durch nicht-kovalente Bindung erfolgen. In einer bevorzugten Ausfulirungsform sind Gleitklam- 
merprotein und Elongationsprotein Ober ein Kopplungsprotein verbunden. 

In einer weiteren bevorzugten Ausfuhrungsform der vorliegenden Erfindung stammen im erfindungsgemaBen thermo- 
20 stabilen in vitro Komplex die assoziierten Proteine aus Archaebakterien. Es ist allerdings im Rahmen der vorliegenden 
Erfindung ebenfalls moghch, dafi die assoziierten Proteine aus Eubakterien stammen. (Jegenstand der vorliegenden Er- 
findung ist ebenfalls ein erfindungsgemaBer thermostabiler in vitro Komplex, bei dem die assoziierten Proteine zum Thil 
aus Archaebakterien und zum Teil aus Eubakterien stammen. 

Im Sinne der vorliegenden Erfindung umfaBt der Begriff prokaryontisches Protein sowohl Proteine aus Archaebakte- 
25 rien und Proteine aus Eubakterien. Es ist bekannt, daB der Replikationsapparat in Arcbaea dem des eukaryontischen Re- 
plikalionsappardles ahnlich isl, obwohl die Genomorganisalion in Eukaryonlen und'Archaea ganzlich verschieden isl 
und die zcllularc Struktur der Eubakterien dem dor Archaca ahnclt. (EdgcU, D.R. and D^litllc, WF. (1997). Archaca and 
the origin(s) of DNA replication proteins. GeU 89, 995-998). 

Desweiteren ist eine bevorzugte Ausfuhrungsform der vorliegenden Erfindung ein thermostabiler prokaryontischer in 
30 viu-o Komplex, bei dem ein Polymeraseaktivitat-aufweisender Proteinkomplex vorliegt, der aus einem Kopplungspro- 
tein und einem oder mehreren Polymeraseaktivitai-aufweisenden Elongationsprotein besteht. Bevorzugt ist desweiteren 
em erfindungsgemaBer thermostabiler prokaryontischer in vitro Komplex, bei dem das Gleitklammerprotein dne ring- 
fomiige Struktur aufweist, welches die komplementaren Nukleinsaurestrange ganz oder teilweise umschlieBt 

Die Gleitklammer 



Die folgenden Ausfuhrungen sollen dazu dienen, die Funktion und die moghchen Erseheinungsformen des Gleitklam- 
merproieins besser zu verstehen. 

Das Gleitklammerprotein erfulli die Funktion, die Polymeraseaktivital an die DNA zu binden. Entweder umschlieBt 

40 das Gleitklammerprotein selbst die DNA ganz oder teilweise oder durch Assozialion an das Polymeraseaktivital aufwei- 
sende Protein beziehungsweise an den Polymeraseaktivitat-aufweisenden Proteinkomplex oder seine Uniereinheit wild 
eine Klammer gebildet In jedem FaU wird durch diese Klammerbildung die Prozessivitat signifikant gesteigert, minde- 
stens urn das eineinhalbfache. 

Das heiBl, der erfindungsgemaBe in vitro Komplex besitzt eine mindestens eineinhalbfache Prozessivitat im \fergleich 

45 zum Elongationsprotein alleine, beziehungsweise im Vergleich zu einem Polymeraseaktivital aufweisenden Protein 
kompiex ohne Gleitklammer oder einer Untereinheit davon. 

Als Gleitkammer konnen beispielsweise Homologe des "Proliferating Cell Nuclear Antigen"-Proteinkomplex aus dem 
humanen Genom, oder Homologe des ebenfaUs ringformigen "P-clamp"-Proteinkomplex aus E. coli dienen, welche aus 
thermostabilen Organismen stammen und somit thermostabil sind, oder aus nicht ihermosiabilen Oiganismen slammen, 

50 und nachuraghch durch Veranderung der Aminosauresequenz thermostabil gemacht wurden (Eijsink VG, van der Zee JR, 
van den Burg B,VriendG,Venema G,FEBSLett 1991 Apr 22; 282(1): 13-16, Improving the thermostability of the neu- 
tral protease of Bacillus stearothermophilus by replacing a buried asparagine by leucine, Bertus Van den Burg, Gert 
Vriend, Oene R. Veltman, Gerard Venema, and Vmcent G. H. Eijsink Engineering an enzyme to resist boiling PNAS 
1998, 95: 2056-2060). Dabei kann die Gleitklammer aus mehreren Komponenten aufgebaut sein. Die im menschlichen 

55 Genom identifizierte Gleitklammer besteht aus drei PCNA-Protein Komponenten (SEQ ID NO: 1 1 ) (Homotrimeres), die 
im H. coli Genom identifizierte Gleitklammer besteht aus zwei Komponenten (SEQ ID NO: 35) (Hofriodimeres). 

Als Gleitklammer im Sinne der vorliegenden Erfindung ist hierbei insbesondere jedes Protein zu verstehen, das die 
funktionclle Eigenschaft der Polymeraseprozessivitatssteigerung besitzt oder der Senkung der Fehlerrate dient. Dazu 
kann die Gleitklammer eine ringformige dreidimensionale Struktur aufweisen oder durch Kopplung an ein anderes Pro- 

60 tein ringfonnige dreidimensionale Struktur hilden, durch die sie in der T^ge ist, ein- und doppelstrangige DNA ganz oder 
teilweise zu umschlieBen. 

Als Gleitklammer im Sinne der vorhegenden Erfindung ist insbesondere ein solches Protein zu verstehen, das 

1. zu der menschlichen (Eukaryonlen) PCNA Aminosauresequenz (SEQ ID NO: 11) auf einer Lange von minde- 
65 stens 100 AminosSuren bei einem Sequenzalignment eine mindestens 20%ige Sequenzidentitat aufweist oder das 

2. zu der bakteriellen p-clamp Sequenz aus E. coli (Eubakteria) (SEQ ID NO: 35) auf einer Lange von mindestens 
100 Aminosauren bei einem Sequenzalignment eine mindestens 20%ige Sequenzidentitat aufweist oder das 

3. zu der Aminosauresequenz des PC^JIA Homologen aus Archaeoglobus Fulgidus (Archaea) (SEQ ID NO: 12) auf 



4 



JDOCID: <DE_^ 1 9840771 A1_L> 



10 



15 



20 



30 



DE 198 40 771 A 1 

eino- Lange von mindestens 100 Aminosauren bei einem Sequenzalignment eine mindestens 20%ige Sequenziden- 
tiiai aufweist. 

Die erfindungsgemaBe Gleitkiammer kann eines cxler mehrere der voigenannten Merkmale aufweisen. 

Die in Abb. 1 aufgefuhrten Sequenzidentitalen wurden mil deni BLAST Algorithmus nach Allschui, SJF., Gisb, W 
Miller, W., Myers, E.W., and Lipman, DJ., J. MoL Biol. 215, 403^10 (1990) ermittelt und werden weiier unten erlau" 
lerL 

Als Gleitkiammer im Sinne der vorliegenden Erfindung sind auBerdem Proteine zu verstehen, die eine oder beide der 
folgenden Konsensussequenzen beinhalten und an nicht mehr als vier Positionen von einer dieser Sequenzen abweichen 
(Abb. 4): 

Region 1 
(SEQIDNO: 39) 

[GAVLIMPFW]-I>X-X-X-[GAVLIMPFW]-X-X-(GAVLIMPFW]-X-[GAVLIMPFW].X-fGAVLI 
M P F W]-X-X-X-X-F-X-X-Y-X-X-D und/oder 

Region 2 
(SEQIDNO: 40) 

[G A V L I M P F \V]-X(3)-L-A-P-[K R H D E]-(G A V L I M P F \V].E. 

Die Aminosauren warden hierbei gemafi der Standard lUPAC Einbuchstaben Nomenklatur be^annt und gemafi dem 25 
Prosile Pattern Beschreibungsstandard aufgefuhrt. Dabei sind die folgenden Auiinoi&urcgruppen haufig zusaiimienge- 

G, A, V, L, I, M, P, F Oder W (Aminosauren mit nicht polaren Seitenketten) 
S, T, N, Q, Y, Oder C (Aminosaure mit ungeladenen polaren Seitenketten) 
K, R, H, D Oder E (Aminosaure unit geladenen und polaren Seitenketten) 

AuBerdem bedeutet X in den Sequenzprotokollen jede beliebige Aminosaure oder Insertion oder Deletion. 

Aus dem in Abb. 12 dargestellten multiplen Alignment von menschlichen PCNA Homoiogen wurde zudem ein Hid- 
den Markov Modell generiert Als Gleitklanuner im Sinne der vorliegenden Erfindung ist somit insbesondere jedes Pro- 
tern zu verstehen, da6 mit dem so erzeugten Hidden Markov Modell einen Score mehr als 20 aufweist (Abb. 12). Die 
Hidden Markov Modelle und die entsprechenden Scores wurden mit dem hmmfs Programm (Version 1.8.4, July 1997) 35 
aus dem HMMER Paket berechnet (HMMER Protein and DNA Hidden Markov Models (Version 1.8) von Sean Eddy 
Dept. of Genetics, Washington University School of Medicine, St. Louis, USA). 

Aus dem in Abb. 13 dargesteUten multiplen Alignment von R coli p-clamp Homoiogen wurde ein Hidden Markov 
Modell generiert. Als Gleitklammer im Sinne der vorliegenden Erfindung ist somit insbesondere jedes Protein zu verste- 
hen, das mit dem so erzeugten Hidden Markov Modell einen Score von mehr als 25 aufweist (Abb. 1 3). 40 

Die Gleitkiammer kann aus mehreren Komponenten aufgebaut sein, die durch eine charakieristische Bindung fest an- 
einander gebunden sind, so daB ein stabiler ringformiger Molekiilkomplex gebildet wird, der nicht ohne Weiteres von der 
DNA dissoziieren kann. Dadurch wird eine feste aber nicht kovalente Bindung an die DNA ermoglicht, die freie \fer- 
schiebbarkeit auf derselben ab«- nicht behindert. Die prozessivitatssteigemden Gleitklammerproteine haben zudem cha- 
rakieristische lokale Molekuleigenschaften im Bereich der ^VechseIwirkungsregion zur DNA, welche die freie Ver- 45 
schiebbariceit erleichtem und die durch in diese Region eingelagerte Wassermolekiile unterstiitzt werden kann. 

Eine bevorzugte Ausfuhrungsform der vorliegenden Erfindung ist weiterhin insbesondere ein thermostabiler prokary- 
ontischer in vitro Komplex, wobei das Gleitklanunerprotein eines der folgenden ist: AF0335 aus Archaeoglobus Fulgi- 
dus, MJ0247 aus Methanococcus Jannaschii, PHLA008 aus Pyrococcus Horikoschii, MTH1312 aus Meihanobacterium 
Thermoautoorophicus sowie AE000761_7 aus Aquifex Aeolicus. 50 

Insbesondere sind thermostabile prokaryontische in viUro Komplexe vom Gegenstand dieser Anmeldung umfaBt, wo- 
bei das Gleitklammerprotein eine Aminosauresequenz gemaB S^QID No. 11, 12, 13, 14. 15 und 36 (Aquifex Aeolicuc) 
aufweist. 

Der Gleitklammerlader 55 

Desweiteren isi als bevorzugte Ausfuhrungsform zu verstehen, wenn zu diesem erfindungsgemaBen Komplex zusatz- 
lich em Gleitklammerlader gehort, welcher die Komponenten der Gleitkiammer urn den ununterbrochenen DNA-Strang 
herum zusammenfiigt, bzw. diese wieder entfemt, wenn die Reaktion beendet wird. Dieser Gleitklammerlader ist bevor- 
zugt mit dem erfindungsgemaBen in vitro-Komplex assoziiert 

Im Menschen besteht der Gleitklammerlader aus funf Uniereinbeiten, 4 kleinen (Gleitklammerlado- 1) und einer gro- 
Ben Untereinheit (Gleitklammerlader 2). Als Gleitklammerlader dienen erfindungsgemaB beispielsweise ein oder meh- 
rere prokaryontische Homologe des im Menschen identifizierten "Replication Factor C"-Proteinkomplexes (Gleitklam- 
merlader 1: SEQ ID NO: 1, 32, 33, 34 und Gleitklammerlader 2 SEQ ID NO: 6). 

Als Gleitklanunerlader 1 im Sinne der vorliegenden Erfindung ist insbesondere ein solches Protein zu verstehen das 
zuder menschlichen (Eukary<Miten) Aminosauresequenz (S^Q ID NO: 1, 32, 33, 34) auf einer Lange von mindestens 
1(X) Aminosauren bei einem Sequenzalignment eine mindestens 20%ige Sequenzidentitai aufweist 

Als (Jleilklammerlader 2 im Sinne der vorliegenden Erfindung ist insbesondere auch ein solches Protein zu verstehen. 
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das zu der menschlichen (Eukaryonten) Aminosauresequenz (SEQ ID NO: 6) auf einer Lange von mindestens 150 Ami- 
nosauren bei einem Sequenzalignment eine mindestens 20%ige Sequenzidentital aufweist. 

Als Klammeriader sind beispielsweise die in Abb. 1 aufgefiihrten Homologe aus Archaebakterien geeignel (SEQ ID 
NO: 3, 4, 5; Homologe zu Gleitklanmierlader 1 und SEQ ID NO: 8, 9, 10, Homologe zu Gleitklammerlader 2). Daher ist 
5 insbesondere bevorzugt ein thennostabiler prokaryotischer in vitro Komplex, wobei zusatzlich ein RFC-homologes Pro- 
tein bzw. Proteinkomplex anwesend ist. 

Als Gleitklammerlader 1 im Sinne der vorliegenden Erfindung ist daher beispielsweise jedes Protein zu verstehen, das 
die folgenden Konsensussequenzen beinhallet und an nicht mehr als vier Positionen von dieser Sequenz abweicht (Abb. 
6): 

10 

SEQ ID NO: 41 

C-N-Y-X-S-[K R H D FJ-T-T-X.[G A V TJ M P F \V]-[G A V TJ M P F W]-Q-S-P-C-X-X-F-R-F-X-P-[G A V T. T M P F 
W]. 

15 

Als Gleitklammerlader 2 im Sinne der vorliegenden Erfindung ist daher beispielsweise auch jedes Protein zu verste- 
hen, das die folgenden Konsensussequenzen beinhallet und an nicht mehr als vier Positionen von dieser Sequenz ab- 
weicht (Abb. 7): 

20 SEQ ID NO: 42 

K-X-X-L-L-X-G-P-P-G"X-G-K-T-[S T N Q Y a-X-[G A V L I M P F W]-X-X-[G A V L I M P F W]. 

Aus dem in Abb. 14 dargestellt^ multiplen Alignment von menschlichen Gleitklammerlader 1 Homologen wurde zu- 
25 dem ein Hidden Maricov Modell generiert. Als Gleitklanunerlader 1 im Sinne der vorliegenden Erfindung ist somit jedes 
Protein zu verstehen, das mil dein so erzeugten Hidden Markov Modell einen Score iliehr als 25 aufweist (Abb. 14). 

Aus dem in Abb. 15 dargcstclltcn multiplen Alignment von menschlichen Glcitklamincrladcr 2 Homologen wurdc zu- 
dem ein Hidden Maricov Modell generiert. Als Gleitklammerlader 2 im Sinne der vorliegenden Erfindung ist somit jedes 
Protein zu verstehen, das mit dem so erzeugten Hidden Markov Modell einen Score mehr als 15 aufweist (Abb. 15). 
30 Bevorzugt isi ebenfalls, wenn ein dem Eubakterium Escherichia coli Y-Komplex homologes Protein als Gleitklammer- 
lader anwesend ist 

AuBerdem bevorzugt ist ein erfindungsgemaBer thennostabiler in vitro Komplex zur Elongation von Nukleinsauren, 
bei dem neben dem Gleitklaimnerlader zusatzlich ATP vorliegen kann, vorzugsweise ebenf alls assoziiert. 

35 Das Kopplungsprotein 

Das Kopplungsprc^ein hat die Funktion, das Elongationsprotein und das Gleitklanunerprotein zu verbinden. Als 
Kopplungsprotein im Sinne der vorliegenden Erfindung ist insbesondere jedes Protein zu verstehen, welches die oben 
beschriebene Funktion besitzt. Dabei sind als Kopplungsproteine beispielsweise die in Abb. 1 aufgefiihrten Homologe 

40 zu der humanen Sequenz der koppekiden Untereinheit (DPD2_HUMAN, [SEQ ID NO: 16]) aus Archaebakterien geeig- 
net (SEQ ID NO: 17, 18, 19, 20, 21). 

Als Kopplungsprotein im Sinne der vorliegenden Erfindung ist insbesondere ein solches Protein zu verstehen, das zu 
der menschlichen (Eukaryonten) Aminosauresequenz (SEQ ID NO: 16) auf einer Liinge von mindestens 150 Aminosau- 
ren bei einem Sequenzalignment eine mindestens 18%ige Sequenzidentital aufweist. 

45 Als Kopplungsprotein im Sinne der vorliegenden Erfindung ist daher jedes Protein zu verstehen. das die folgende 
Konsensussequenz beinhallet und an nicht mehr als vier Positionen von dieser Sequenz abweicht (Abb. 5): 

SEQ ID NO: 43 

50 [FL]- [G A V L I M P F W]-X-X-[G A V L I M P F W]-X-G-X( 1 3)- [G A V L I M P F W]-X-[ YR]-[G A V L I M P F W]- 
X-[G A V L I M P F W]-A-G-[DN]-[G A V L I M P F W]-[G A V L I M P F W]-[DS]. 

Aus dem in Abb. 16 dargestellten multiplen Alignment von Homologen zur menschlichen koppelnden Untereinheit 
wurde zudem ein Hidden Markov Modell generiert. Als koppelnde Untereinheit im Sinne der vorliegenden Erfindung ist 
55 somit jedes Protein zu verstehen, das mit dem so erzeugten Hidden Markov Modell einen Score mehr als 10 aufweist 
(Abb. 16). 

Polymeraseaktivitat-aufweisendes Elongationsprotein bzw. Polymeraseakti vital aufweisender Enzymkomplex bzw. Un- 
tereinheit davon 

60 

Einige Elongationsproteine benoligen die Anwesenheit eines Kopplungsproteins (koppelnde Untereinheit). um iiber- 
haupt Polyroeraseaktivitat aufzuweisen. Es ist aber auch vorstellbar, daB das Elongationsprotein direkt an die Gleitklam- 
mer bindet. Andere Elongauonsproteine benotigen die Anwesenheit eines Kopplungsproteins fiir die Bindung an die 
Gleitklammer. 

65 Das Elongationsprotein weist eine 5'-3'-Polymeraseaktivitat oder eine Reverse-TVanskripiase-Akdvitat auf. Dabei sind 
als Elongationsproteine beispielsweise die in Abb. 1 aufgefiihrten zum menschlichen Elongationsprotein (SEQ ID NO: 
22) Homologe aus Archaebakterien geeignet (SEQ ID NO: 23, 24, 25. 26). 

Bevorzugterweise bindet das Elongationsprotein an das Kopplungsprotein. welches wiederum an das (jleitklammer- 
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protein gekoppell ist 

Als Elongaiionspioiein im Sinne der vorliegenden Erfindung ist insbesondere ein solches Protein zu verstehen, das zu ' 
der menschlichen OEukaryonten) Aminosauresequenz (SEQ ID NO: 22) auf einer Lange von mindcstens 200 Aminosau- 
ren bei einem Sequenzalignment eine mindestens 20%ige Sequenzidentitat aufweist. 

Als Elongationsprorein ina Sinne der vorliegenden Erfindung ist daber insbesondere jedes Prctfein zu verstehen, das die 5 
folgende Konsensussequenz beinhaltet und an nicht mehr als vier Positionen von dieser Sequenz abweicht (Abb. 8): 

SEQ ID NO: 44 

D-[G AVLIMPFW]-[GAVLIMPF W]-X-X-Y-N-X-X-X-F-D-X-P-Y-[G A V L I M P F W]-X-X.R-A. lo 

Aus dem in Abb. 17 dargestellten multiplen Alignment von Horaologen zum menschlichen Elongationsprotein (SEQ 
ID NO: 22) wurde zudem ein Hidden Markov Mode)! generiert. Als Elongationsprotein im Sinne der vorliegenden Er- 
findung ist somit insbesondere jedes Protein zu verstehen, das mit dem so erzeugten Hidden Markov Model] einen Score 
mehr 20 aufweist (Abb. 17). ^ - 

Als Elongationsprotein im Sinne der vorliegenden Erfindung ist insbesondere auch ein solches Protein zu verstehen, 
das zu der archaeabakteriellen Aminosauresequenz (SEQ ID NO: 27) auf einer Lange von mindestens 400 Aminosauren 
bei emem Sequenzalignment eine mindestens 25%ige Sequenzidentitat aufweist. Beispielsweise sind geeignet die aus 
Archaebakterien stanrmienden Proteine mit der SEQ ID NO: 28, 29, 30 oder 31. 

Als Elongationsprotein im Sinne der vorliegenden Erfindung ist daher insbesondere auch jedes Protein zu verstehen, 20 
das die folgende Konsensussequenz beinhaltet und an nicht mehr als vier Positionen von dieser Sequenz abweicht (Abb! 



SEQ ID NO: 45 

A-[G A V LI M P F W]-R-T-A.[G A V LI M P F W]-A-[G A V LI M P F W]-[G A VlU M P FW]-T-E-G-fG A V L I M 
P F UTV-X-A-P-IG A V L I M P F W]-E-G-I-A-X-y-[K R H D E]-! ' 1.1 ivi 

Aus dem in Abb. 18 dargesteUlen multiplen Alignment von Homologen zum archebakteriellen Elongationsprotein 
(SEQ ID NO: 27) wurde zudem ein Hidden Markov ModeU generiert Als Elongationsprotein im Sinne der vorliegenden 
Erfindung 1st somit insbesondere jedes Protein zu verstehen, das mil dem so erzeugten Hidden Markov Modell einen 
Score mehr als 35 aufweist (Abb. 1 8). 

Das Elongationsprotein kann auch eubakteriellen Ursprungs sein. 

Als Elongationsprotein im Sinne der vorhegenden Erfindung ist somit ebenfaUs ein solches Protein zu verstehen das 
zu der eubakteriellen Aminosauresequenz (SEQ ID NO: 37) auf einer Liinge von mindestens 300 Aminosauren hei ei- 
nem Sequenzalignment eine mindestens 25%ige Sequenzidentitat aufweist. 

Als Elongationsprotein im Sinne der vorliegenden Erfindung ist daho: auch insbesondere jedes Protein zu verstehen 
d^ die folgende Konsensussequenz beinhaltet und an nicht mehr als acht Positionen von dieser Sequenz abweicht (Abb! 

SEQ ID NO: 46 

[GAVLIMPFW]-P-V-G-[GAVLIMPFW]-G-R-G-S-X-[GAVLIMPFW]-G-S-[GAVLIMPF\\n^ 
A-[G A V L I M P F W]-X-I-T-D-(G A V L I M P F W].D-P-[G A V L I M P F W]-X-X-X-[G A V L I M P F Wl-L-F-E- 
R-F-L-N-P-E-R-[G A V L I M P F W]-S-M-P-D. 

Aus dem in Abb. 19 dargestellten multiplen AUgnment von Homologen zum eubakteriellen ElongationsiM-otein (SEQ 
ID Na 37) wurde zudem ein Hidden Markov ModeU generiert. Als Elongationsprotein im Sinne der vorliegenden Er- 
findung ist somit jedes Protein zu verstehen, das mit dem so erzeugten Hidden Markov Modell einen Score mehr 20 auf- 
weist (Abb. 19). 

Bisher wurden einige DNA Polymerasen als Elongationsproteine ohne Kopplungsprotein und ohne Gleitklammer fur 
Standard PCR Reaktionen eingesetzt, so z, B. DNA Polymerase I aus Pyrococcus fbriosus (United States Patent No 
5,545,552) oder Pyrococcus species (European Patent Application No: 0 547 359 Al). Diese Enzyme zeichnen sich 
durch die Eigenschaft aus, thOTnostabil zu sein und haufig eine 3'-5' exonuklease Aktivitat ('proof-reading' Aktivitat) zu 
besilzen. Erst vor kurzem wurde ein Heterodimer mit Polymeraseaktivitat in Pyrococcus furiosus entdeckt fUemori T 
Sato, y., Kato, L, Doi, H., and Ishino, Y. (1 997). A novel DNA polymerase in the hyperthermophilic archaeon, pyrococ- 
cus funosus: gene clewing, expression, and characterizauon. Genes to Cells 2, 499-512.) 

Bevorzugt ist im Rahmen der vorliegenden Erfindung, daB der erfindungsgemaBe prokaryontische in viu-o Komplex 
zur Elongation yon Nukleinsauien aus Proteinen bestehl, die aus Archaea stanamen. Das heiBi, bevorzugt ist die \ferwen- 
dung eines Gleitklammeq^roteins aus Archaebakterien. Desweiteren ist bevoiv.ugt, daB das Hlongarionsproiein bezie- 60 
hungsweise ein Polymeraseakdvitat-aufweisender Proteinkomplex, welcher aus Elongationsprotein und Kopplunsspro- 
tem besteht, aus Archaeabakterien stanmiL 

Beispiele solcher fur den erfiiidungsgemaBen Komplex geeigneter Proteine sind in Abb. 1 und 2 dargestellt. 

Natiirlicb ist es moglich, durch Deleiionen oder Mutationen oder durch das Anfugen von Aminosauren die Eigen- 
schafien dieser Proteine zu optimieren. Diese veranderten Proteine sind ebenfaUs Gegensiand der vorliegenden Erfin- 65 
dung solange sie den erfindungsgemaBen in vitro Komplex zur Elongation von Nukleinsauren bilden und die oben naher 
spezifizierten Funktionen erfullen. 

Zur Veranschaulichung des erfindungsgemaBen Komplexes dient Abb. 3, die beispielhaft den Replikationsappaiat in 
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einer moglichen Variante darstellt, wobei die Gleitkammer uber eine koppelnde Untereinheit an das Elongationsprotein 
bindet. 

Deswdteren ist bevorzugt, daB zusatzlich zu dem erfindungsgemaBen prokaryotischen akzessorischen in vitro Kom- 
plex zwei Primer anwesend sind. Primer sind in der Regel Oligonukleotide, welche an die beiden komplementaren DNA 
s Strange der Zielsequenz binden, wobei sie in gegensaizlicher Orientierung, ihre 3'-Enden zueinander gerichtet, den zu 
amplifizierenden Abschnitt einschlieBen. Sie dienen als Startpunkt der Amplifikation und stellen in der Regel ein freies 
3'-0H Ende fiir die Polymerase zur Inkorporation eines Nukleotides zur Verfugung. 

Wahrend der Verwendung des Proteinkomplexes zur Amplifikation, Elongation und Sequenzierung liegt der erfin- 
dungsgemalie Komplex bevorzugterweise in einem geeigneten Puffer vor Geeignete Puffer sind solche, die fur PGR, Se- 

10 quenzierung, Nukleinsauremarkierung und anderen in vitro Nukleinsaure Elongationsreaktionen mittels Polymerase An- 
wendung finden. Geeignete Puffer werden beispielweise beschrieben in Methods in Molecular Biology Vol. 15 Humana 
Press Tolowa, New Jersey, 1993, edited by Bruce A. White. 

Wahrend der Verwendung des Proteinkomplexes zur Elongation, Amplifikation, Reversen TVanskription oder/und Se- 
quenzierung liegt zusatzlich zu dem erfindungsgemaBen Komplex ein Gemisch aus Nukleotiden vor. Deoxynukleotide 

15 konnen aus dGTP, dATP, dTTP und dCTP ausgewahlt werden, sind jedoch nicht auf diese beschrankt. Zusatzlich konnen 
auch Derivate von Deoxynukleotiden gemafi der Erfindung verwendet werden, welche als solche Deoxynukleotide defi- 
niert sind, die in der Lage sind, durch eine thermostabiie DNA Polymerase in wachsende DNA Molekule inkorporiert zu 
werden, die in der Reaktion synthetisiert werden. Solche Derivate scblieBen Thionukleotide, 7-deaza-2 -dGTP, 7-deaza- 
2'-dATP sowie Deoxyinosine Triphosphat, das auch als Ersatz Deoxynukleotid fiir dATP, dGTP, dTTP oder dCTP ver- 

20 wendet werden kann, ein, sind aber nicht auf diese beschrankt. Ebenso konnen markierte Deoxynukleotide verwendet 
werden. AUe bekannten oder/und zu dem erfindungsgemaBen Zweck geeigneten Markierungen konnen dabei vorliegen. 

Dideoxynukleotide konnen aus ddGTP, ddATP, ddTTP und ddCTP ausgewahlt werden, sind jedoch nicht auf diese be- 
schrankt. Zusatzlich konnen auch Derivate von Dideoxynukleotiden gemaB der Erfindung verwendet werden, die als sol- 
che Dideoxynukleotide definiert sind, die in der Lage sind, durch eine thermostabiie DNA Polymerase in wachsende 

25 DNA Molekule inkorporiert zu werden, die in der Reaktion synthetisiert werden. Solche Derivate konnen radioakti ve Di- 
deoxynuUeolide (ddATP, ddGTP, ddTTP und ddCTP) oder Dideoxynukleotide (ddJVJP, ddGTP, ddTTP und ddCTP), 
wclchc mit z. B. FTTC, Cy5, Cy5.5, Cy7 und Tcxas-Rol oder andcrcn markicrt sind, cihschlicBcn, sind abcr nicht auf 
diese beschrankt. Im Rahmen einer erfindungsgemaBen Sequenzierung konnen auch markierte Deoxynukleotide zusam- 
men mit unmarkierten Dideoxynukleotiden eingesetzi werden, 

30 Ribonukleotide konnen aus GTP, ATP, TTP und CTP ausgewahlt werden, sind jedoch nicht auf diese beschrankt. Zu- 
satzlich konnen auch Derivate von Ribonukleotiden gemaB der Erfindung verwendet werden, die als solche Ribonukleo- 
tide definiert sind, die in der Lage sind, durch eine thermostabiie DNA Polymerase in wachsende DNA Molekule inkor- 
poriert zu werden, die in der Reaktion synthetisiert werden. Solche Derivate konnen radioaktive Ribonukleotide (ATP, 
GTP, TTP und CTP) oder Ribonukleotide (ATP, GTP, TTP und CTP), welche mit z. B. FTTC, Cy5, Cy5.5, Cy7 und Te- 

35 xas-Rot Oder andeien maridert sind, einschlieBen, sind aber nicht auf diese beschrankt. 

Wahrend der Verwendung des Proteinkomplexes zur Amplifikation, EltMigation und Sequenzierung kann es sich als 
vorteilhaft erweisen wenn bei der Reaktion eine Pyrophosphatase anwesend ist 

Ein weiterer Gegenstand der vorliegenden Erfindung ist ein thermostabiler, akzessorischer in vitro-Komplex, welcher 
ein Gleitklamraerprotein und ein Kopplungsprotein umfaBt, wobei beide Proteine wie oben definiert sind. 

40 Der erfindungsgemaBe akzessorische in vitro-KompIex ist gewissermafien als Vorstufe zu dem oben beschriebenen er- 
findungsgemaBen thermostabilen in viUro-Komplex zur Template-abhangigen Elongation von Nukleinsauren zu versie- 
hen. 

Der akzessorische Komplex muB lediglich mit einem Elongationsprotein kombiniert werden, um diese m eine deutlich 
erhohte Prozessivitat zu verleihen. Der akzessorische in vitro-Komplex kann daher auch zusammen mit bekannten ther- 
45 mostabilen Polymerasen eingesetzt werden, wobei ebenfalls Nachteile dieser bekannten Proteine hinsichtlich insbeson- 
dere der Prozessivitat verringert werden konnen. 

Identifizierung der Gene, Klonierung der Gene, Expression dieser und Reinigung der Proteine der erfindungsgemaBen 
in vitro- Komplexe: 

In der Regel konnen die Komplexe, bestehend aus rekombinanten Proieinen, mit den folgenden Schritien berdtgestellt 
50 werden: Bereitstellung des Nukleinsaurefragmentes welches fiir das gewunschte Protein kodiert. Ligation in einen Ex- 
pressionsvektor. Transformation in einen Wirt, Expression und Aufreinigung des Proteins, In Ubereinstimmung mit der 
vorliegenden Erfindung kann es vorkommen, daB Gene, insbesondere aus den Archaebakterien, Inteine (Proc Natl Acad 
Sci USA 1992 Jun 15; 89(12): 5577-5581, Intervening sequences in an Archaea DNA polymerase gene, Perler FB, 
Comb DG, Jack WE, Moran LS, Qiang B, Kucera RB, Benner J, Slatko BE, Nwankwo DO, Hempstead SK, et al) ent- 
55 halten konnen, welche zunachst entfemt werden konnen. 

Die Identifizierung weiterer fiir den erfindungsgemaBen Komplex geeigneten Proteine kann z. B. erfolgen durch Ho- 
mologiesucheh in Datenbanken, welche Genome aus Prokaryonten umfassen. Einige Programme sind hierfiir geeignet, 
beispielsweise das Programm BLASTP und FASTA (Altschul, Stephen F., Thomas L Madden, Alejandro A. Schaffer,' 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new ge- 
60 neration of protein database search programs". Nucleic Acids Res. 25: 3389-3402. WR Pearson & D J T inman PNAS 
(1988)85:2444-2448). 

Ke Identifizierung kann auch dadurch geschehen, da6 DNA Sonden verwendet werden, um in z. B. gesamtgenom- 
ischen Banken aus Prokaryonten nach den entsprechenden Genen zu screenen. Die hierfiir notigen experimentellen Ver- 
fahren finden sich in Maniatis et al. (Molecular Cloning: A laboratory Manual, Cold Spring Harbor Laboratory Press 
65 N.Y.(1989)). ^ 

Die Bereitstellung der gereinigten Nukleinsaure der Gene der erfindungsgemaBen Komplexe kann z. B. uber deren 
Isolierung aus einer genomischen Bank des relevanten Organismus geschehen oder durch synthetische DNA-Herstel- 
lung, jeweils gewiinschtenfalls kombiniert mit einer Amplifikation mittels K:r, unter Zuhilfenahme von Primem, wel- 
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che fiir den gewunschten Genabschniit sperifisch sind. Ubliche Verfahren sind in Maniatis et al. (Molecular Cloning: A 
laboratory Manual, Cold Spring Harbor Laboratory Press, N.Y (1989)) beschrieben. 

Die Gene der Proteine der crfindungsgemaBen in vilro-Komplexe konnen in einer Vielzahl von Verfahren klonicrt 
werden und somit mittels eines Expressionsvektors zur Proteinexpression in einem Wirtsorganismus zur Verfiigung ge- 
stellt werden. Gangige Verfahren sind in Maniatis el al. (Molecular Cloning: A laboratory Manual, Cold Spring Harbor 5 
Laboratory Press, N.Y.(1989)) beschrieben. Die Gene der Komplexe konnen hierbei z. B. zunachst in einen liigh-copy* 
Vektor, z. B. pUClS, pBst oder pBR322 kloniert werden und dann erst in einen prokaryontischen Expressionsvektor, 
z. B. pTrc99 umkloniert werden, oder aber direkt in einen prokaryontischen Expressionsvektor kloniert werden. Unier 
Vektoren sind hierbei Nukleinsauren zu verstehen, die in der Lage sind, ein andenes Nukleinsauremolekul in oder zwi- 
schen verschiedenen Organismen. bzw. genetischen Hintergrunden zu uransportieren. Sie haben in der Regel die Fahig- lo 
keit der autonomen Replikation und/oder der Expression (Cxpressionsvekloren) des operativ verbundenen Nufcleinsau- 
remolekiils. "Operativ verbunden" bedeutet, daB das transportierte Nukleinsauremolekul so mit dem \fektor verbunden 
ist, daB es unter der Transkriptions- und TranslationskonU-oIle von ExpressionskontrolIsequen7£n des Vektors steht und 
in einer Wirtszelle expriniiert werden kann. Bakterielle Expressionssysteme, die bevorzugte Verwendung derer, sowie 
eine Auswahl an Vektorsysiemen ist z. B. in 'Gene Expression Technology', (Melh. EnzymoL, Vol 185. Goeddel, Ed., 15 
Academic Press, N.Y. (1990)) beschrieben. Geeignete Vektoren fiir die vorliegende Erfindung sollten unterschiedlich 
Starke Expression der Proteine dadurch ermoglichen, daB sie einige oder alle der folgenden Eigenschaften besitzen: (1) 
Promotoren, oder TVanskripiionsiniliationssiellen, entweder unmittelbar neben dem Start des Proteins oder als Fusions- 
protein, (2) Operaioren die verwendet werden konnen Genexpression an oder aus zu schalten, (3) ribosomale Bindungs- 
stellen fur eine verbesserte Translation, und (4) Terminationsstellen fur die Transkription od«- Translation, die zu verbes- 20 
serter Stabilitat fiihren. 

Expressionsvektoren, die mil eukaryontischen Zellen, bevorzugterweise mit Vertebratenzellen kompatibel sind, kon- 
nen auch Verwendung finden. Einige bekannte Vektoren sind pSVL und pKSV-10 (Pharmacia), pBPV-l/pML2d ^nter- 
nationai Biotechnologies, Inc.), und pTDTl (ATCC 31255). Die Verwendung eines retroviralen Expressionsvektors ist 
auchmogUdi. - ^- 

Weilere Gegenstande dear vorliegenden Erfindung sind daher DNA-Sequenzen, w^che fur die t^fmdungsgemaBen 
thrcmostabilcn in viUTO-Komplcxc bzw. akzcssorischcn in viUro-Komplcxc codicrcn, sowic cntsprcchcadc \fcktorcn, vor- 
zugsweise Expressionsvektoren. 

Die erfindungsgemaBen Vektoren enthallen mindestens das Gen fur das Gleitklammerprotein und vorzugsweise min- 
destens ein Gen fiir ein Kopplungs- oder/und Elongationsprotein, wie sie jeweils oben definiert sind. 30 

Es ist dabei im Rahmen der Erfindung bevorzugt, wenn der Vektor neben den darin bereits enthaltenen DNA-Sequen- 
zen noch geeignete Resuiktionsschnittstellen und ggf. Polylinker zur Insertion weiierer DNA-Sequenzen enthalL Beson- 
ders bevorzugt ist es, wenn die raumliche Anordnung von bereits enthaltener DNA-Sequenz und zusatzlicher Insertions- 
stelle nach Expression zur Ausbildung eines Fusionsproteins fiihrt. 

Ebenfalls bevorzugt ist es, wenn der erfindungsgemaBe Vektor Promotor- oder/und Operatorbereiche enthalt, wobei es 35 
besonders bevorzugt ist, wenn derartige Promotor- oder/und Operatorbereiche induzierbar oder reprimierbar sind. Da- 
durch wird eine Steuerung der Expression in \Vnrtszellen a-heblich vereinfacht und kann besonders effizient gestaltet 
werden. 

Derartige Promotor/Gperatorberciche konnai in einem Expressionsvektor auch mehrfach vorkommen, so daB eine 
ggf unabhangige Expression mehrerer DNA-Sequenzen unter Verwendung nur eines Expressionsvektors ermogiicht 40 
wird. 

Ein besonders bevorzugter Vektor enthalt im Rahmen der Erfindung DNA-Sequenzen codierende fiir alle Bestandteile 
eines erfindungsgemaBen in vilro-Komplexes oder akzessorischen in vitro-Komplexes. 

Ein weiterer Gegenstand der vorliegenden Erfindung ist eine Wirtszelle, enthallend einen oder mehrere erfindungsge- 
ma6e(n) Vektor(en), wobei in dieser Wuiszelle unter geeigneten Bedingungen die Expression zu Proteinen erfolgen 45 
kann. Geeignete Bedingungen schheBen beispielsweise Anwesenheit eines Induktors oder eines Derepressors ein. 
^ Fur die Transformation, Phageninfektion und Zellkultur existieren StandardprotokoUe in Maniatis et al. (Molecular 
Cloning: A laboratory Manual, Cold Spring Harbor Laboratory Press, N.Y). Aus der Vielzahl der vorhandenen E. coli 
Slanmie, die zur Transformation geeignet sind, sind die bevorzugten JMlOl (ATCC No. 33876), XLl (Su^tagene), RRI 
(ATCC No. 31343) und BL21 (Pharmacia). Proteinexpression kann z. B. auch geschehen unter Zuhilfenahme des E. coli 50 
Stammes INVoF (Inviu-ogen). 

Die Transformanten werden gemaB den geeigneten Wachstumsbedingungen des Wirtsstaiiunes kultivierl. So werden 
die meisten E. coh Stamme z. B. in LB Medium kultiviert, bei 30°C bis 42**C bis zur logarithmischen oder stationaren 
Wachstumsphase. Die Proteine konnen aus einer transfomuerlen Kultur gereinigt wo-den, wobei dies entweder aus ei- 
nem Zellpellet, nach ZenUifugation, oder aber aus der Kulturlosung geschehen kann. Sofem die Proteine aus dem Zell- 55 
pellet gereinigt wo-den, werden die Zellen in einem geeigneten Pulfer resuspendiert und mittels Sonilizierung, enzyma- 
tischer Behandlung oder Einfrieren und Auftauen aufgebrochen. Sofem die Reinigung aus der Kuitursuspension ge- 
schieht, das heiBt alleine oder uber ein Fusionsprotein, wird der Uberstand von den Zellen mittels bekannter Verfahren, 
wie Zenuifugation abgelrennt. 

Die Separation und Reinigung der Proteine der erfindungsgemaBen Komplexe entweder aus dem UberTitand der Kul- 60 
turlosung oder aus dem Zellextrakt kann durch bekannte Separations- oder Reinigungsverfahren geschehen. Diese Me- 
thoden sind z. B. solche die LosUchkeiten beu^fifen, wie Salzfallungen, und Losungsmitielfallungen, Meihoden die sich 
die unterschiediichen Molekulargewichie zu Nutze machen, wie Dialyse, Ulu^afitotion, Gelfitotion, und SDS-Poly- 
acrylamidgeieiekurophorese, Methoden die sich die unterschiediichen Ladungen zu Nuize machen, lonenaustauschchro- 
matographie, Methoden die sich die unterschiediichen Hydrophobizitaten zu Nutze machen, wie reverse-phase HPLC 65 
(High Performance Uquid Chromatography), Methoden die sich bestinmite Affinitaten zu Nutze machen, wie Affinitat- 
schromalographie, und Methoden die sich Unterschiede im Isoelekirischen Punkt zu Nutze machen, wie isoelektrische 
Fokussierung. Es ist auch vors!eUbar, daB ZellexU^le, entweder aus dem Organismus, welchCT das (Jen des akzessori- 



9 



5D0CID: <DE 19840771Al_L> 



10 



15 



25 



30 



35 



40 



45 



50 



55 



60 



DE 198 40 771 Al 

schen Komplexes tragi, geniacbt werden konnen, welcher die erfindungsgemaBe Aufgabe erfuUt, oder aus dem rekom- 
binanten Wirtsorganismus, z. B. E. coli. Mil diesen Exiraklionen konnte man unler Umstanden andere Reiniguncs- 
schntie umgehen. ^ * 

Die oben beschriebenen Verfahren konnen in einer X^elzahl von Kombinationen eineesetzt werden um die Proteine 
des m vjtro-Komplexes bereitzustellen. 

per erfindungsgemaBe thermostabile prokar>'ontische akzessorische in vitro Komplexes kann zur Elongation von Nu- 
kleinsauren verwendet werden, z. B. zur Polymerase-Ketten-Reaktion, DNA Sequenzierung, zur Markierung von Nukle^ 
insauren und anderen Reaktionen, die die Nukleinsaure in vitro Synthese beinhalten. Auch mogUch ist die Anwendung in 
der reversen 'Jranskription, wobei entweder dec erfindungsgemaBe Komplex selbst reverse 'IVanskriptase Aktivitat be- 
sitzi, Oder aber zusStzlich ein geeigneles Enzym hinzugefugt wird, das reverse Transkripiase Aktivitat besitzt. 

Ein weiterer Gegenstand der vorliegenden Erfindung ist daher ein Verfahren zur Template-abhangigen Elongation von 
Nuklemsauren, wobei die Nukleinsaure notigenfalls denaturiert, mil mindestens einem Primer unter Hybridisierungsbe- 
dingungen versehen wird, wobei der Primer komplementar zu einem flankierenden Bereich einer gewunschten Nuklein- 
sauresequenz des Templatestrangs ist und mil Hilfe einer Polymerase in Anwesenheit von Nukleotiden eine Primerelon- 
gauon erfolgt, wobei als Polymerase ein erfindungsgemafier thermostabiler in vitro-Komplex eingesetzt wird. 

Verfahren zur Template-abhangigen Hongadon von Nukleinsauren, bei denen die Elongation ausgehend von einem 
Pnmer erfolgt, der an die Template-Nukleinsaure anhybridisiert wurde und ein freies 3'-0H-Ende fiir die Elongation zur 
Verfijgung steUt, sind dem Fachmann bekannt. Zur Amplifikation wird insbesondere eine Polymerasekeuenreaktion 
durchgefimn. Hierbei wird normalerweise von einer doppelstrangigen DNA-Sequenz ausgegangen, von welcher ein be- 
summter Zielbereich amphfiziert werden soU. Hierbei werden zwei Primer eingesetzt, welche zu die Zielsequenz flan- 
ki^enden Bereichen auf jeweils einem Teilstrang des DNA-Doppelstranges komplementar sind. Zur Anhybridisierung 
der Pnmer werden allerdings die DNA-Doppelstrange zuerst denaturiert, insbesondere thermisch aufgeschmolzen Nach 
Anhybndisierung der Primer erfolgt eine Elongation mittels der Polymerase, daraufhin wird nochmals denaturiert und 
damitdie neu gebildeten DNA-Strange von den Template-Strangen getrennt, woraufhin^r einen weiteren Elongations- 
zyklus neben den urspriinglichen Templatestrangen auch die ini ersten Schritt gebildeten Nukleinsaurestrange als Tem- 
plate zur Verfugung stehen, diese jeweils emeul iidt Priiiiem hybridisiert werden und ^e emeute Elongation slattfindet 
Di^c Vorgchcnswcisc wird zyklisch durchgcfuhrt unicr jeweils thcrmischcr Dcnaturicmhg als Zwischcnschrittc 

Zur erfindungsgemaB bevorzugten Reversen Transkription von RNA in DNA wird ebenfalls ein erfindungsgemaBer 
thermostabiler in vitro-Komplex eingesetzt, wobei dessen Elongationsprotein eine Reverse Transkriptase-Aktivita auf- 
weist. Diese Reverse Transkriptase-Aktivitat kann die einzige Polymeraseaktivitat des Elongationsproteins sein kann 
aber auch zusatzlich zu einer vorhandenen 5'-3'-DNA-Polymeraseaktivitat vorliegen. 

Ein wei teres bevorzugtes erfindungsgemaBes Verfahren betriffi die Sequenzierung von Nukleinsauren ausgehend von 
einem Primer, der zu einem der zu sequenzierenden Nukleinsaure benachbarten Bereich komplementar ist, wobei wie- 
derum eme Teniplate-abhangige Elongation oder aber bei Sequenzierung einer ^RNA eine Reverse TVanskription unter 
Verwendung von Deoxynukleodden und Dideoxynukleotiden gemaB der Methode von Sanger durchgefuhrt wird Als 
Deoxynukleotide oder Dideoxynukleotide werden im Rahmen dieser bevorzugten Ausfuhrungsform auch die oben be- 
schnebenen jeweiligen Derivate als geeignet angesehen. Insbesondere ist es fur die erfindungsgemaBen \ferfahren zur 
Elongatiorivon Nukleinsauren bevorzugt, daB die gebildeten Nukleinsauren markiert werden. Hierzu ist es mogUch 
markieite Primer oder/und maikierte Deoxynukleodde oder/und markierte Dideoxynukleodde oder/und markierte Ribo^ 
nukleotide oder jeweils entsprechende Derivate, wie sie oben bereits beispielsweise beschrieben sind, einzusetzen 

Wederum ein weiterer Gegenstand der vorliegenden Erfindung ist ein Verfahren zur Markierung von Nukleinsauren 
durch binfiigung einzelner Briiche in Phosphodiesterbindungen der Nukleinsaureketle und Ersatz eines Nukleotids an 
den BruchsteUen durch ein markiertes Nukleodd niit Hilfe einer Polymerase, wobei als Polymerase ein erfindungsgema- 
Ber thermostabiler in vitro-Komplex eingesetzt wird. 

Ein solches Verfahren, allgemein Nick-Transladon genannu ermoglicht eine einfache Markierung von Nukleinsauren 
Alie oben bereits beschriebenen mariderten Ribonukleodde oder Deoxyribonukleodde oder Derivate davon sind hierfiir 
geeignet, solange die Polymerase sie als Substrat akzepdert. 

Diese erfindungsgemaBe Mariderung kann z. B. auch im Rahmen oder anschlieBend an eine PCR-Reaktion erfolgen 
wofur die erfindungsgemaBe Verwendung eines thermostabilen in viuo-Komplexes besonders gunstig ist. 

Gegenstand der vorliegenden Erfindung ist ebenfaUs ein Kit zur Elongadon oder/und AmpUfikation oder/und Rever- 
sen Transkription oder/und Sequenzierung von Nukleinsauren, wobei dieser Kit in einem oder in mehreren Behaltem 
vorhegen kann, enthaltend 

a) einen erfindungsgemaBen diermostabilen in vitro Komplex oder 

b) einen thermostabilen, akzessorischen in vitro-Komplex und ggf. separat davon ein PolymeraseakUvitai aufwei- 
sendes Elongauonsprotein sowie ggf. Primer, Puffersubstanzen, Nukleotide, AIT, artdo-e Kofakforen oder/und Pv- 
rophosphat, 

Insbesondere ist Gegenstand der vorliegenden Erfindung ein Kit zur Elongation, Amplifikation, Reversen TVanskrip- 
tion, Markierung bzw. Sequenzierung von Nukleinsauren, wobei zusatzlich Deoxynukleotide bzw, deren Derivate ent- 
haltend sind 



Ein bevoizugter erfindungsgemaBer Reagenzienkit enthalt daher zur Amphfikation von Nukleinsauren neben den Sub- 
stanzen a) oder b), welche 5'-3'-Polymeraseaktivitat aufweisen, auch Deoxynukleotide oder/und Derivate davon Gege- 
benenfalls konnen hier auch Ribonukleotide oder Derivate davon eingesetzt werden, namhch dann, wenn eine Polyme- 
rase eingesetzt wird, welche auch Ribonukleotide als Subsu^t akzeptiert. 

Kn weiterer bevorzugter Gegenstand der vorliegenden Erfindung ist ein Kit zur Sequenzierung von Nukleinsauren, 
wobei zusatzlich zu Deoxynukleotide bzw. deren Derivaten, Dideoxynukleotide bzw. deren Derivate zur Ketlenlermi- 
naaon enthaltend sind. 
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Desweitercn ist insbesondere Gegenstand der vorliegenden Erfindung ein Kit zur reversen Thnskrition von Nuklein- 
sauren, wobei entweder der erfindungsgemaBe Komplex selbst reverse Transkriptase Aktivitat besitzt, oder aber zusatz- 
lich ein gecignetes Enzym anwesend ist, das reverse TVanskriptase Aktivitat besitzt, wobei Deoxynukleotide bzw deren 
Denvaten im Reaktionsgemisch enthalten sind. ' 

einer weiteren besonders bevorzugten Ausfuhrungsform enUiSlt der Kit Primer oder^nd Deoxynukleotide oder/und 
Dideoxynukleoiide oder/und Ribonukleotide oder^nd deren jeweiUge Derivate in markierter Form 

Insbesondere fiir die Sequenziening von Nukleinsauren ist es n5tig, eine Markierung einzufugen. Geeignete Markie- 
rungen and weiter oben bereits in beispielhafter Form beschiieben und gelten ebenso als bevorzugte Ausfiihrunesfor- 
raenimRahmender^rfiDdungsgemaBenReagenzienkits. uiuungsror 

Im Rahmen der vorUegenden Erfindung ist es weiterhin mOgUch, daB der Reagenzienkit zur Markierung von Nukle- 
insauren im Rahmen emer sogenannten Nick-Translation eingesctzt wini. In diescm Fall enthalt der Reagenzienkit die 
Bestandteile a) oder b) und markierte Nukleotide, wobei Puffersubslanzen. ATP oder andere Kofaktoren oder/und Pyrcv 
phosjihat ehenfalls anwesend sein konnen. Primer werden alleniings im Rahmen einer Nick-Translation in der Reeel 
nicnt oenotigL "^^6^' 

• ^^^f^^Jf" vorliegenden Erfindung ist desweiteren die Verwendung eines theimostabilen Gleitklammerproteins 
NuUdlSiS^ ^ Elongation, Amplifikation, Markierung bzw. Sequenziening oder Reversen TranskripUon von 

^ Gleitklammerprotein ein thermostabiles Homolog zu dem PCNA-Protein (SEO ID 
m^. 1^, 13. J4 15), dem PCNA-Homologen aus Archeaoglobus Fulgidus oder dem 6-cIamp-Protein (SEO ID NO- ^5 
36)voiliegt. Insbesondere ist ein Kit bevorzugt, der zusaizlich ein Kopplungsprotein enthalt 

m^ri/nrf "^v bf voizugu der zusatzlich ein weiteres Rotein enthaltend, wobei das weitere Protein als Klam- 
marlader zur Verfugung steht, auch aus der Gruppe der Prokaryonten stammt und thermostabil ist 
daSJ^r gedgneten Puffer enthalt, wie oben beschrieben. Bevorzugt ist ebenfalls 
daB der erfindungsgemaBe Kit zusatzhch eine Pyrophosphatase. ATP oder/und andere Kofaktoren emhalf 
Die folgenden Beispiele soUen in Veibindung mit den Abbildungen die Erfindung naher erlautem: 

Abbildungen 

der'SlSu^ntd^rlS^^^^ 

Abb. 1 

^teinsequetizen, paarweise Alignments und mulUple AUgnments aus Archaebakterien und den entsprechenden hu- 
manen Oenen des Rephkaiionsapparates. 

Wobei die Annotauon: ' bedeuiet %Identitat zu dem entsprechenden humanen Gen, berechnet aus dem paarweisen 
tenZTA^^^^T"'^ T/^^^.-^-^ [Feb-24-1998] und die Annotation ^ %Iden,itat zu dem entspxechenSn 
SiOQ«i ? "'^"'3'' paarweisen Alignment (siehe Anhang) mit BLASTP 2.0.4 [Feb- 

24-1998] und die Annotation ' %Identitat zu dem entsprechenden humanen Gen. berechnet aus dem paarweisen 
Alignment mit FASTA 3. 1 102 [March, 1998]* bedeutet. Die Verfahren werden naher beschrieben in- P^^^'**" 
Altschul, Stephen TTiomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David 
A fT ^..^^,^oI^,^P^ ^^^^^ PSI-BLAST: a new generation of prWein database Lich progranis^NucTdc 
t n ^^f . T."!" * ^P™'*" PNAS (1988) 85: 2444 2448. Die AbbilduSt im fS 

des Gleitklammerladex^ I, GleiUdammerlader D. Gldtklammer. koppelnde Untereinheit und Elongationsp^tein I die 
^rnTSfI'-?".^"'A " Datenbanken und ebenso deren SEQ ID Nummer. wobei die Werte in den Klammem jeweils Pk> 
zentldemtat je Anzahl Ammosauren darsteUen. Ira Falle des Hongalionsprotein H beziehen sich die Wferte auf Ptozent 
Sequenzidentitat zu der Archaeaoglobus fulgidus Sequenz. ^»"«:ni 



Abb. 2 



Hi/^nnr?^"^ K P'^'^Jlf Ahgnments und muluple Alignments aus Eubakterien des Replikationsapparates. Wobei 
die Annotauon ' bedeule. %Idenn.at zu dem entsprechenden Gen ausE. coli, berechne. aus dem paarweisen Alignme^ 
Setntn Tlp^^^^ "4^.' [Feb-24-1998]S. Das Verfahren wird in: Altschul. Stephen F^^omas L. 
S^^AKT ^^"S ^ ^'^^^'^ J- Lipinan (1997). "Gapped BLAST ^d 

JM-ULA6 i:.a new generaUon of protem database search programs". Nucleic Acids Res. 25: 3389-3402. naher beschrie- 



Abb.3 



heif^TasXSSS M^^^^^^ ^'"^^ '"^'''^^"""^^ "'"^ ^'"^ ^""'-•"''^ Unterein- 



Abb. 4 



Die Abb. 4 zeigt Alignments zweier konserviener Regionen des Gleitklammerproteins, sowie die daraus heigeleiteten 
Konsensussequenzen. Folgende Gene sind gezeigt: PCNA human (aus SEQ ID NO: 11), die entsprechende sSnz aus 
iS'ta^ii^ ;E0 ^^^^^^^^ "S-P' Methanococcus janashii (aus SEQ £> NO: 1?. aus h^" 

Koscmi (aus ShQ m NO. 14) und aus Methanococcus themtoauiothrophicus (aus SEQ ID NO: 15). 
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Abb, 5 

Die Abb. 5 zeigt ein Alignment einer konservierten Regionen der koppelnden Untereinheit, sowde die daraus heige- 
leiteten Konsensussequenzen, Folgende Gene sind gezeigt: PfuORK>, DPD2_HUMAN, AF1790 und MJ0702. Die SEQ 
5 ID Nummem konnen Abb. 1 entnommen werden. 

Abb, 6 

Die Abb. 6 zeigt ein Alignment einer konservierten Regionen der koppelnden Untereinheit, sowie die daraus heige- 
10 leiteten Konsensussequenzen. Folgende Gene sind gezeigt: AC11_HUMAN, AF2060, MTH0241, PHBN012 und 
MJ1422. Die SEQ ID Nummem konnen Abb. 1 entnommen werden. 

Abb. 7 

15 Die Abb. 7 zeigt ein Alignment einer konservierten Regionen des Gleitklammerlader 2, sowie die daraus heigeleiteten 
Konsensussequenzen. Folgende Gene sind gezeigt: AC15_HUMAN, MJ0884, AF1195, MTH0240 und MTH0240. Die 
SEQ ID Nummem konnen Abb. 1 entnommen werden. 

Abb. 8 

20 

Die Abb. 8 zeigt ein Alignment einer konservierten Regionen des Elonagationsproteins 1, sowie die daraus heigelei- 
teten Konsensussequenzen. Folgende Gene sind gezeigt: DPOD_HUMAN, MJ0885, MTH1208, PHBT047 und 
DPOL_ARCFU. Die SEQ ID Nummem konnen Abb. 1 entnommen werden. 

25 Abb. 9 

Die Abb. 9 /a.*!-;! cin Alignment cincr konservierten Regionen des ElongationsprotcinS^, sowie die daraus hcrgclcitc- 
ten Konsensusscqucn/cn. l olgende Gene sind gezeigt: AF1722, MJ1630, PfuORF3, MTH1536 und PHBN021. Die 
SEQ ID Nuninicrn konnen Abb. 1 entnommen werden. 

30 

Abb. 10 

Die Abb. 10 zcigi ein Alignment einer konservierten Regionen des Elongationsproteins aus Eubakterien, sowie die 
daraus hergelcitcicii Konscniiussequenzen. Folgende Gene sind gezeigt: DP3A_EC0LI: DNA Pol HI, alpha subunit, 
35 Escherichia coli, BB0579: DNA Pol EI, alpha subunit, BorreUa burgdorferi, DP3A_HELPY: DNA Pol HI, alpha subunit, 
Helicobacter pylori AA50: Aquifex aeolicus, section 50 und DP3A_SALTY: DNA Pol IE, alpha subunit, SahnoneUa 
typhimurium). 

Abb. 11 

40 

I>ie Abb. 11 zeigt ein Alignment einer konservierten Regionen der Gleitklammer aus Eubakterien, sowie die daraus 
hergeleiteten Konsensussequenzen. Folgende Gene sind gezeigt: AAPOL3B, DP3B_EC0LI, S TYPHIM 
DP3B_PROMI, DP3B_PSEPU und DP3B_STRC0 (AAPOL3B: Aqufex Aeolicus sektion 93: DP3B_ECOLI: DNA Pol 
m, beta chain, Escherichia coli S.TYPHIM: DNA Pol HI, beta chain, Sahnonella typhimurium P3B_PR0M[: DNA Pol 
45 in, beta chain, Probeus mirabiUs DP3B_PSEPU: DNA Pol m, beta chain, Pseudomonas putida DP3B_STRCO: DNA 
Pol in, beta chain, Strptomyces coelicolor). 

Abb. 12 

50 Multiples Alignment der Gleitklammer-proteinsequenzen zur Generierung des Hidden Markov Models. 

Abb. 13 

Multiples Alignment der eubakteriellen Gleitklammerproteinsequenzen zur Generierung des Hidden Markov Models 
55 (AAP0L3B: Aqufex Aeolicus sektion 93: DP3B_ECX)LI: DNA Pol m, beta chain, Escherichia coli S.TYPHIM: DNA 
Pol UI, beta chain. Salmonella typhimurium P3B_PR0M1: DNA Pol lU, beta chain, Probeus mirabiTis i)P3B_PSEPU: 
DNA Pol m, beta chain, Pseudomonas putida DP3B_STRCO: DNA Pol m, beta chain, Suptomyces coelicolor). 

Abb. 14 

60 

Multiples Alignment der Gleitklammeriader 1 Proteinsequenzen zur Generierung des Hidden Markov Models. 

Abb. 15 

65 Multiples Alignment der Gleitklammerlader 2 Proteinsequenzen zur Generierung des Hidden Markov Models. 
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Abb. 16 

Multiples Alignment der Proteinsequenzen der koppelnden Untereinhciten zur Generiening des Hidden Markov Mo- 
dels. 

Abb. 17 

Multiples AUgnment der Sequenzen der Elongationsproteine 1 zur Generiening des Hidden Markov Models. 

Abb. 18 

Multiples Alignment der Sequenzen der Elongationsproteine 2 zur Generiening des Hidden Markov Models. 

Abb. 19 

Multiples Alignment der Sequenzen der eubakterieUen Elongationsproteine zur Generiening des Hidden Markov Mo- 
dels. Foigende Gene sind gezeigt* 

A-^i^; ^^^r ^""^ ™' ^P*"^ Escherichia coli, BB0579: DNA Pol IE, alpha subunit, Borrelia burgdorferi, 

nP^^S^nM^^ofm^V^^^*^^^ '"^V"^"- HeUcobacter pylori AA50: Aquifcx aeolicus, section 50 und 
DP3A_SALTY: DNA Pol HI, alpha subunit, SalmoneUa typhimurium). 

Beispiel 

Die DNA wird mittels der bekannten Verfahren aus dem Organismus Archaeoglobus folgidus (DSM No 4304) eerei- 
mgt. Orgamsmen Aufzucht geschieht hierbei durch die DSM (Deutsche Sammlung ffir Mikroorganismen) Um die ent- 
sprechenden Gene (Gleilklainmerlader I/O, Gkitklaiiuner, Elongaiionsproleine MI, k-^^pelnde Unlereinheit) in den Ex- 
prcssionsvcktor pTrc99 zu klonicrcn, wcrdcn fur jcdcs Gen Primer cntwickclt, die den voUstandigcn offcncn Lcscnrah- 
men umspannen, sowie Start- wie Stopcodon. Hierbei werden den Primera jeweils Resuiktionsenden hinzugefiigt, die 

u^^'^fi'/f^ Kloniening m den Expressionsvektor erleichtem. Unter Verwendung von etwa 200 ng Gesamteenom- 
ischer DNA werden mit den entsprechenden Annealing-Temperaturen PGR Reaktionen gefahren (etwa 35 Zyklen) und 
die daraus resuluerenden Produkte gereinigt Nacb der Reinigung werden die Produkte mit Restriktionsenzymen behan- 
delt und uber em Agarosegel gereinigt um zur Ligation bereit zu stehen. Der Expressionsvektor wird miuels Restrikti- 
onse-nzymen so hneansiert, gereinigt und verdunnt, daB er zur Ligation mit den Amplifikaten der akzessorischen Gene 
aus der obigen PGR bereit ist. Die Ligauon wird angesetzt und nach Inkubation ein Aliquot in den E. coli Stamm IN^^o 
hah anvtrogen) transfonniert. Von jedem Gen werden 3 positive Kolonien gepickt, Plasmid DNA prapariert und die In- 
serts auf VoUstandigkeii lind Richtigkeit mittels DNA Sequenzienjng iiberpriift. Korrekte Klone werden ausgesucht und 
erneui zur Ver-emzelungauf Agarplaiten (ampicillin) ausgebracht. Kolonien werden gepickt und Cbemachdculturen an- 
gcseizt. Em Abquot (500 pi) der Ubemachtkultur wird in eine ein bis fiinf Liter Kultur von LB (Ampicillin- gO me/l) ee- 
geben. Die Kulturen wachsen bis zu einer ODeeo von 0.8 bei 37T nun wird zur Induktion IPTG zugegeben (125 me^) 
Oiese Kulturen wachsen nun weitere 1 1 Stunden. Die Kulturen werden zentrifugiert und die PeUeu in einem Puffer auf- 
genommen (Puffer A: 50 mM Tris-HCl pH 7.9, 50 mM Dextrose, 1 mM EDTA). Nach Zentrifugation werden die ZeUen 
ernem autgenommen Jedoch enthalt Puffer A nun zusatzlich Lysozym (4 mg/ml). Nach Inkubation (15 min.) wind ein 
gleiches Voiumen Pufe (B: 10 mM Tris-HCl pH 7.9, 50 mM KQ, 1 mM EDTA, 1 mM PMSF, 0.5% 

Tween 20, 0.5% Nonidet P40) und die Lyse zur Inkubation bei 75°C fiir eine Stunde gegeben, Nach Zentrif\igation wird 
der Uberstand entnommen und die uberexprimierten Proteine werden mittels (miOiSO^ ausgefallt. Die PeDets werden 
nach Zeninfugaiion gesamraelt und die Proteine mit PuflFer A resuspendiert. Die resuspendierten Proteine werden eeeen 
Aun>ewahningspufiFer (50 mM Tris-HCl pH 7.9, 50 mMKQ, 0.1 mMEDTA, 1 mM DTP, 0.5 mM PMSF, 50% Glycerol) 
dialy siert und anschlieBend bei -70X gelagert. . ^^KJiy^^s^i 

Um die Aklivitat der Proteine zu testen werden Reaktionen wie folgt zusammengeseizi: Aliquote der Proteine werden 
m unterschiedhchen Konfigurationen und Molaritaten vereint, Gleitklammerlader m mit Gleitklammei; koppelnder Un- 
tereinheit und Elongationsprotein I, oder Gleitklammerlader im mit Gleitklammer, mit und ohne koppelnder Unterein- 
heit und Elongationsprotein H, oder Gleitklammer, und Elongationsprotein I oder H und schlieBlich nur Elongationspro 
tern I Oder D; als Rjffer dient der obige Aufbewahningspuffer. DNA PolymerisationsakUvitat wird mittels Inkorporation 
von (Methyl- H) TTP m Tnchlorsaure-unlosliches Material geraessen Gshino, Y, Iwasaki, H., Fukui, H., Mineno J 
Kato, I., & Schimgawa, H. (1992) Biochimie 74, 131-136). Um die Prozessivitat zu bestimmen werden die obigen Pro^ 
leingemische m Pnmer-Elongationsexperimenten verwendet. Ein M13 Einzelstrangtemplate wird in 10 niM 'Ihs-HCl 
(pH 9.4) eingebracht und gemeinsam mit einem Universal-fwimer (5-Fnx: markiert) erhitzt (92°C) und abgekiihlt 
(Raumtemperatur). Verdiinnungsserien des so generierten Template-Primer Gemisches werden in einer Reaktion besle- 
hend aus Nukl^tiden (etwa 200 pM bis 1 mM), Reaktionspuffer (Endkonzentration: 50 mM KQ, 10 mM Tris-HQ (pH 

. K iw^c^fSL'o^I! "^^--^ "'^ ""^ proteinst^hiJisierenden Agenzien zusammengef^ihrt und fur lOMi- 
nuten be, 37 C, 52^ 62^ 68^ 74^, und 78^ inkubiert. Ein Aliquot wird zur Analyse auf einen Sequenzautomaten geladen 
(z. B. Alf, Phannacie Biotech). Dieobigen Proteingemischedienen auch dazu Fidelity und Exonukleaseaktivitat zu mes- 
TsdS^^sso /fo7^^^^ t ^Z' 7958-7962 (1991) oder Chase et al. (J. BioL Chem., 249: 

434^^^:) (1972) beschnebenen Verfahren zur Anwendung kommen. Ebenso werden die Proteingemische in der PGR 
eingesetzt (Methods in Molecular Biology Nfol. 15 Humana Press Totowa, New Jersey, 1993, edited by Bruce A. White). 
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SEQUENZPROTOKOLL 

(1) ALLGEMEINE ANGABEN: 

(i) ANMELDER: 

(A) NAME: LION bioscience AG 

(B) STRASSE: Im Neuenheimer Feld 517 

(C) ORT: Heidelberg 

(E) LAND: DE 

(F) POSTLEIJZAHL: 69120 

{ii) BEZEICHNUNG DER ERFINDUNG: Akzessorische Komplexe mit 
Polymeraseaktivltaet 

(iii) ANZAHL DER SEQUENZEN: 38 

(iv) COMPUTER-LESBARE FASSUNG: 

(A) DATENTRAGER: Floppy disk 

(B) COMPUTER: IBM PC compatible ^ 

(C) BETRIEBSSYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version ^.30 (EPA) 



(2) ANGABEN ZU SEQ ID NO: 1 : 

(i) SEQUENZKENNZEICHEN: 

(A) LANGE: 340 Aminosauren 

(B) ART: Aminosaure 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: beides 

(ii) ART DES MOLEKULS: Protein 

(vi) URSPRONLICHE HERKUNFT: 

(A) ORGANISMUS: Homo sapiens 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 1: 

Met GIu Thr Ser Ala Leu Lys Gin Gin Glu Gin Pro Ala Ala Thr Lys 
15 10 15 

He Arg Asn Leu Pro Trp Val Glu Lys Tyr Arg Pro Gin Thr Leu Asn 
20 25 30 
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Asp Leu lie Ser His Gin Asp lie Leu Ser Thr lie Gin Lys Phe lie 
35 40 45 

Asn Glu Asp Arg Leu Pro His Leu Leu Leu Tyr Gly Pro Pro Gly Thr 
50 55 60 

Gly Lys Thr Ser Thr lie Leu Ala Cys Ala Lys Gin Leu Tyr Lys Asp 
65 70 75 80 

Lys Glu Phe Gly Ser Met Val Leu Glu Leu Asn Ala Ser Asp Asp Arg 
85 90 95 . 

Gly lie Asp lie lie Arg Gly Pro lie Leu Ser Phe Ala Ser Thr Arg 
100 105 110 

Thr lie Phe Lys Lys Gly Phe Lys Leu Val lie Leu Asp Glu Ala Asp 
115 120 125 

Ala Met Thr Gin Asp Ala Gin Asn Ala Leu Arg Arg Val Ik Glu Lys 
130 135 140 

Phe Thr Glu Asn Thr Arg Phe Cys Leu lie Cys Asn Tyr Leu Ser Lys 
145 150 155 160 

lie lie Pro Ala Leu Gin Ser Arg Cys Thr Arg Phe Arg Phe Gly Pro 
165 170 175 

Leu Thr Pro Glu Leu Met Val Pro Arg Leu Glu His Val Val Glu Glu 
180 185 190 

Glu Lys Val Asp lie Ser Glu Asp Gly Met Lys Ala Leu Val Thr Leu 
195 200 205 

Ser Ser Gly Asp Met Arg Arg Ala Leu Asn lie Leu Gin Ser Thr Asn 
210 215 220 

Met Ala Phe Gly Lys Val Thr Glu Glu Thr Val Tyr Thr Cys Thr Gly 
225 230 235 240 

His Pro Leu Lys Ser Asp lie Ala Asn lie Leu Asp Trp Met Leu Asn 
245 250 255 

Gin Asp Phe Thr Thr Ala Tyr Arg Asn lie Thr Glu Leu Lys Thr Leu 
260 265 270 

Lys Gly Leu Ala Leu His Asp lie Leu Thr Glu lie His Leu Phe Val 
275 280 285 
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His Arg Val Asp Phe Pro Ser Ser Val Arg lie His Leu Leu Thr Lys 
290 295 300 

Met Ala Asp lie Glu Tyr Arg Leu Ser Val Gly Thr Asn Glu Lys lie 
305 310 315 320 

Gin Leu Ser Ser Leu lie Ala Ala Phe Gin Val Thr Arg Asp Leu lie 
325 330 335 



Val Ala Glu Ala 

IS 340 

(2) ANGABEN ZU SEQ ID NO: 2: 

20 (i) SEQUENZKENN2EICHEN: 

lA) LANGE: 319 Aminosauren 
(B) ART: Aminosaure 

IC) STRANGFORM: Einzelstrang 

ID) TOPOLOGIE: linear 

lii) ART DES MOLEKOLS: Protein 

30 

(VI) URSPRUNLICHE HERKUNFT: 

(A) ORGANISMUS: Archaeoglobus fulgidus 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 2: 

Met Glu Asn Phe Glu He Trp Val Glu Lys Tyr Arg Pro Arg Thr Leu 
1 5 10 15 

Asp Glu Val Val Gly Gin Asp Glu Val lie Gin Arg Leu Lys Gly Tyr 
^5 20 25 30 

Val Glu Arg Lys Asn lie Pro His Leu Leu Phe Ser Gly Pro Pro Gly 
35 40 45 

50 

Thr Gly Lys Thr Ala Thr Ala lie Ala Leu Ala Arg Asp Leu Phe Gly 
50 55 60 

Glu Asn Trp Arg Asp Asn Phe lie Glu Met Asn Ala Ser Asp Glu Arg 
65 70 75 80 

^ Gly lie Asp Val Val Arg His Lys lie Lys Glu Phe Ala Arg Thr Ala 
85 90 95 



65 
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Pro lie Gly Gly Ala Pro Phe Lys lie lie Phe Leu Asp Glu Ala Asp 
100 105 110 

Ala Leu Thr Ala Asp Ala Gin Ala Ala Leu Arg Arg Thr Met Glu Met 
115 120 125 

Tyr Ser Lys Ser Cys Arg Phe lie Leu Ser Cys Asn Tyr Val Ser Arg 
130 135 140 

lie ile Glu Pro He Gin Ser Arg Cys Ala Val Phe Arg Phe Lys Pro 
145 150 155 160 

Val Pro Lys Glu Ala Met Lys Lys Arg Leu Leu Glu ile Cys Glu Lys 
165 170 175 

Glu Gly Val Lys lie Thr Glu Asp Gly Leu Glu Ala Leu Ile Tyr lie 
180 185 190 

Ser Gly Gly Asp Phe Arg Lys Ala lie Asn Ala Leu Gin Gly Ala Ala 25 
195 200 205 

Ala Ile Gly Glu Val Val Asp Ala Asp Thr Ile Tyr Gin lie Thr Ala 

210 215 220 30 

Thr Ala Arg Pro Glu Glu Met Thr Glu Leu lie Gin Thr Ala Leu Lys 
225 230 235 240 



lU 



15 



20 



35 



40 



45 



50 



Gly Asn Phe Met Glu Ala Arg Glu Leu Leu Asp Arg Leu Met Val Glu 
245 250 255 

Tyr Gly Met Ser Gly Glu Asp Ile Val Ala Gin Leu Phe Arg Glu lie 
260 265 270 

Ile Ser Met Pro Ile Lys Asp Ser Leu Lys Val Gin Leu Ile Asp Lys 
275 280 285 

Leu Gly Glu Val Asp Phe Arg Leu Thr Glu Gly Ala Asn Glu Arg lie 
290 295 300 

Gin Leu Asp Ala Tyr Leu Ala -Tyr Leu Ser Thr Leu Ala Lys Lys 
305 310 315 

55 

(2) ANGABEN ZU SEQ ID NO: 3: 

(i) SEQUENZKENNZEICHEN: 

I A) LANGE: 1847 Aminosauren 

(B) ART: Aminosaure 

(C) STRANGFORM: Einzelstrang 

65 
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(D) TOPOLOGIE: linear 

^ (ii) ART DES MOLEKULS: Protein 

(vi) URSPRONLICHE HERKUNFT:. 

(A) ORGANISMUS: Methanococcus jannaschii 



,5 (xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 3: 

Met Val lie lie Met Glu Lys Pro Trp Val Glu Lys Tyr Arg Pro Lys 
1 5 10 15 

20 

Thr Leu Asp Asp lie Val Gly Gin Asp Glu lie Val Lys Arg Leu Lys 
20 25 30 

25 Lys Tyr Val Glu Lys Lys Ser Met Pro His Leu Leu Phe Ser -Gly Pro 
35 40 45 V 

Pro Gly Val Gly Lys Cys Leu Thr Gly Asp Thr Lys Val lie Val Asn 
^ 50 55 60 

Gly Glu lie Arg Glu lie Gly Glu Val lie Glu Glu lie Ser Asn Gly 
65 70 75 80 

Lys Phe Gly Val Thr Leu Thr Asn Asn Leu Lys Val Leu Gly lie Asp 
85 90 95 

40 

Glu Asp Gly Lys lie Arg Glu Phe Asp Val Gin Tyr Val Tyr Lys Asp 
100 105 110 

45 Lys Thr Asn Thr Leu lie Lys lie Lys Thr Lys Met Gly Arg Glu Leu 
115 120 125 

Lys Val Thr Thr Tyr His Pro Leu Leu lie Asn His Lys Asn Gly Glu 
50 130 135 140 

lie Lys Trp Glu Lys Ala Glu Asn Leu Lys Val Gly Asp Lys Leu Ala 
145 150 155 160 

55 

Thr Pro Arg Tyr lie Leu Phe Asn Glu Ser Asp Tyr Asn Glu Glu Leu 
165 170 175 

^ Ala Glu Trp Leu Gly Tyr Phe He Gly Asp Gly His Ala Asp Lys Glu 
180 185 190 



65 
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Ser Asn Lys He Thr Phe Thr Asn Gly Asp Glu Lys Leu Arg Lys Arg 
1 95 200 205 

Phe Ala Glu Leu Thr Glu Lys Leu Phe Lys Asp Ala Lys lie Lys Glu 
210 215 220 

Arg lie His Lys Asp Arg Thr Pro Asp He Tyr Val Asn Ser Lys Glu 
225 230 235 240 

Ala Val Glu Phe lie Asp Lys Leu Gly Leu Arg Gly Lys Lys Ala Asp 
245 250 255 

Lys Val Arg lie Pro Lys Glu He Met Arg Ser Asp Ala Leu Arg Ala 
260 265 270 

Phe Leu Arg Ala Tyr Phe Asp Cys Asp Gly Gly He Glu Lys His Ser 
275 280 285 

He Val Leu Ser Thr Ala Ser Lys Glu Met Ala Glu Asp Leu Val Tyr 
290 295 300 

Ala Leu Leu Arg Phe Gly lie He Ala Lys Leu Arg Glu Lys Val Asn 
305 310 315 320 

Lys Asn Asn Asn Lys Val Tyr Tyr His He Val He Ser Asn Ser Ser 
325 330 335 

Asn Leu Arg Thr Phe Leu Asp Asn He Gly Phe Ser Gin Glu Arg Lys 
340 345 350 

Leu Lys Lys Leu Leu Glu lie He Lys Asp Glu Asn Pro Asn Leu Asp 
355 360 365 

Val lie Thr He Asp Lys Glu Lys He Arg Tyr He Arg Asp Arg Leu 
370 375 380 

Lys Val Lys Leu Thr Arg Asp lie Glu Lys Asp Asn Trp Ser Tyr Asn 
385 390 395 400 

Lys Cys Arg Lys He Thr Gin Glu Leu Leu Lys Glu He Tyr Tyr Arg 
405 410 415 

Leu Glu Glu Leu Lys Glu He Glu Lys Ala Leu Glu Glu Asn He Leu 
420 425 430 

lie Asp Trp Asp Glu Val Ala Glu Arg Arg Lys Glu lie Ala Glu Lys 
435 440 445 
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Thr Gly lie Arg Ser Asp Arg lie Leu Glu Tyr lie Arg Gly Lys Arg 
450 455 460 

^ Lys Pro Ser Leu Lys Asn Tyr lie Lys He Ala Asn Thr Leu Gly Lys 
465 470 475 480 

j„ Asn lie Glu Lys lie lie Asp Ala Met Arg lie Phe Ala Lys Lys Tyr 
485 490 495 

Ser Ser Tyr Ala Glu lie Gly Lys Met Leu Asn Met Trp Asn Ser Ser 
15 500 505 510 

lie Lys lie Tyr Leu Glu Ser Asn Thr Gin Glu lie Glu Lys Leu Glu 
515 520 525 

20 

Glu lie Arg Lys Thr Glu Leu Lys Leu Val Lys Glu lie Leu Asn Asp 
530 535 540 

25 Glu Lys Leu lie Asp Ser lie Gly Tyr Val Leu Phe Leu Ala S<r Asn 
545 550 555 560 

^ Glu lie Tyr Trp Asp Glu He Val Glu lie Glu Gin Leu Asn Gly Glu 
565 570 575 

Phe Thr lie Tyr Asp Leu His Val Pro Arg Tyr His Asn Phe lie Gly 

35 580 585 590 

Gly Asn Leu Pro Thr lie Leu His Asn Thr Thr Ala Ala Leu Cys Leu 
595 600 605 

40 

Ala Arg Asp Leu Phe Gly Glu Asn Trp Arg Asp Asn Phe Leu Glu Leu 
610 615 620 

45 Asn Ala Ser Val Ser Lys Asp Thr Pro lie Leu Val Lys lie Asp Gly 
625 630 635 640 

Lys Val Lys Arg Thr Thr Phe Glu Glu Leu Asp Lys lie Tyr Phe Glu 
^ 645 650 655 

Thr Asn Asp Glu Asn Glu Met Tyr Lys Lys Val Asp Asn Leu Glu Val 
650 665 670 

55 

Leu Thr Val Asp Glu Asn Phe Arg Val Arg Trp Arg Lys Val Ser Thr 
675 680 585 

60 

lie lie Arg His Lys Val Asp Lys lie Leu Arg lie Lys Phe Glu Gly 
690 695 700 
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Gly Tyr He Glu Leu Thr Gly Asn His Ser He Met Met Leu Asp Glu 
705 710 715 720 

Asn Gly Leu Val Ala Lys Lys Ala Ser Asp lie Lys Val Gly Asp Cys 
725 730 735 



Phe Leu Ser Phe Val Ala Asn lie Glu Gly Glu Lys Asp Arg Leu Asp 
740 745 750 



10 



Leu Lys Glu Phe Glu Pro Lys Asp lie Thr Ser Arg Val Lys lie He 
755 760 765 



15 



Asn Asp Phe Asp He Asp Glu Asp Thr Ala Trp Met Leu Gly Leu Tyr 
770 775 780 

Val Ala Glu Gly Ala Val Gly Phe Lys Gly Lys Thr Ser Gly Gin Val 
785 790 795 800 



20 



He Tyr Thr Leu Gly Ser His Glu His Asp Leu ile Asn Lys Le,u Asn 
805 810 815 



25 



Asp lie Val Asp Lys Lys Gly Phe Ser Lys Tyr Glu Asn Phe Thr Gly 
820 825 830 



30 



Ser Gly Phe Asp Arg Lys Arg Leu Ser Ala Lys Gin Ile Arg lie Leu 
835 ^ 840 845 

Asn Thr Gin Leu Ala Arg Phe Val Glu Glu Asn Phe Tyr Asp Gly Asn 
850 855 860 

Gly Arg Arg Ala Arg Asn Lys Arg lie Pro Asp Ile lie Phe Glu Leu 
865 870 875 880 



35 



40 



Lys Glu Asn Leu Arg Val Glu Phe Leu Lys Gly Leu Ala Asp Gly Asp 
885 890 895 



45 



Ser Ser Gly Asn Trp Arg Glu Val Val Arg Ile Ser Ser Lys Ser Asp 
900 905 910 



50 



Asn Leu Leu Ile Asp Thr Val Trp Leu Ala Arg Ile Ser Gly lie Glu 
915 920 925 

Ser Ser Ile Phe Glu Asn Glu Ala Arg Leu Ile Trp Lys Gly Gly Met 
930 935 940 



55 



Lys Trp Lys Lys Ser Asn Leu Leu Pro Ala Glu Pro lie Ile Lys Met 
945 950 955 960 



60 



65 



21 



SOCXID: <DE. 



.1 9640771 A1J_> 



DE 198 40 771 A 1 

lie Lys Lys Leu Glu Asn Lys lie Asn Gly Asn Trp Arg Tyr lie Leu 
965 970 975 

Arg His Gin Leu Tyr Glu Gly Lys Lys Arg Val Ser Lys Asp Lys lie 
980 985 990 

Lys Gin lie Leu Glu Met Val Asn Val Glu Lys Leu Ser Asp Lys Glu 
995 1000 1005 

Lys Glu Val Tyr Asp Leu Leu Lys Lys Leu Ser Lys Thr Glu Leu Tyr 
1010 1015 1020 

Ala Leu Val Val Lys Glu lie Glu He lie Asp Tyr Asn Asp Phe Val 
1025 1030 1035 1040 

Tyr Asp Val Ser Val Pro Asn Asn Glu Met Phe Phe Ala Gly Asn Val 
1045 1050 1055 

Pro lie Leu Leu Hts Asn Ser Asp Glu Arg Gly lie Asp Val \[e Arg 
1060 1065 1070 

Thr Lys Val Lys Asp Phe Ala Arg Thr Lys Pro lie Gly Asp Val Pro 
1075 1080 1085 

Phe Lys He lie Phe Leu Asp Glu Ser Asp Ala Leu Thr Ala Asp Ala 
1090 1095 1100 

Gin Asn Ala Leu Arg Arg Thr Met Glu Lys Tyr Ser Asp Val Cys Arg 
1105 1110 1115 1120 

Phe lie Leu Ser Cys Leu Thr Gly Asp Ala Lys lie Thr Leu Pro Asp 
1125 1130 1135 

Glu Arg Glu lie Lys lie Glu Asp Phe lie Lys Met Phe Glu Glu Arg 
1140 1145 1150 

Lys Leu Lys His Val Leu Asn Arg Asn Gly Glu Asp Leu Val Leu Ala 
1155 1160 1165 

Gly Val Lys Phe Asn Ser Lys lie Val Asn His Lys Val Tyr Arg Leu 
1170 1175 1180 

Val Leu Glu Ser Gly Arg Glu He Glu Ala Thr Gly Asp His Lys Phe 
1185 1190 1195 1200 

Leu Thr Arg Asp Gly Trp Lys Glu Val Tyr Glu Leu Lys Glu Asp Asp 
1205 1210 1215 
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Glu Val Leu Val Tyr Pro Ala Leu Glu Gly Val Gly Phe Glu Val Asp 
1220 1225 1230 

Glu Arg Arg lie lie Gly Leu Asn Glu Phe Tyr Glu Phe Leu Thr Asn 
1235 1240 1245 

Tyr Glu lie Lys Leu Gly Tyr Lys Pro Leu Gly Lys Ala Lys Ser Tyr 
1250 1255 1260 

Lys Glu Leu lie Thr Arg Asp Lys Glu Lys lie Leu Ser Arg Val Leu 
1265 1270 1275 1280 

Glu Leu Ser Asp Lys Tyr Ser Lys Ser Glu He Arg Arg Lys lie Glu 
1285 1290 1295 

Glu Glu Phe Gly lie Lys He Ser Leu Thr Thr He Lys Asn Leu lie 
1300 1305 1310 

Asn Gly Lys lie Asp Gly Phe Ala Leu Lys Tyr Val Arg Ly< lie Lys 
1315 1320 1325 

Glu Leu Gly Trp Asp Glu lie Thr Tyr Asp Asp Glu Lys Ala Gly lie 
1330 1335 1340 

Phe Ala Arg Leu Leu Gly Phe lie lie Gly Asp Gly His Leu Ser Lys 
1345 1350 1355 1360 

Ser Lys Glu Gly Arg lie Leu lie Thr Ala Thr lie Asn Glu Leu Glu 
1365 1370 1375 

Gly lie Lys Lys Asp Leu Glu Lys Leu Gly He Lys Ala Ser Asn lie 
1380 1385 1390 

He Glu Lys Asp lie Glu His Lys Leu Asp Gly Arg Glu lie Lys Gly 
1395 1400 1405 

Lys Thr Ser Phe lie Tyr lie Asn Asn Lys Ala Phe Tyr Leu Leu Leu 
1410 1415 1420 

Asn Phe Trp Gly Val Glu lie Gly Asn Lys Thr He Asn Gly Tyr Asn 
1425 1430 1435 1440 

He Pro Lys Trp lie Lys Tyr Gly Asn Lys Phe Val Lys Arg Glu Phe 
1445 1450 1455 

Leu Arg Gly Leu Phe Gly Ala Asp Gly Thr Lys Pro Tyr lie Lys Lys 
1460 1465 1470 
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Tyr Asn lie Asn Gly lie Lys Leu Gly lie Arg Val Glu Asn lie Ser 
1475 1480 1485 

Lys Asp Lys Thr Leu Glu Phe Phe Glu Glu Vat Lys Lys Met Leu Glu 
1490 1495 1500 



10 



Glu Phe Glu Val Glu Ser Tyr lie Lys Val Ser Lys lie Asp Asn Lys 
1505 1510 1515 1520 



15 



Asn Leu Thr Glu Leu lie Val Lys Ala Asn Asn Lys Asn Tyr Leu Lys 
1525 1530 1535 



20 



Tyr Leu Ser Arg lie Ser Tyr Ala Tyr Glu Lys Asp Asn Phe Ala Arg 
1540 1545 1550 

Leu Val Gly Glu Tyr Leu Arg lie Lys Glu Ala Tyr Lys Asp lie lie 
1555 1560 1565 



2S 



Leu Lys Glu lie Ala Glu Asn Ala Leu Lys Glu Ala Asp Gly Glu Lys 
1570 1575 1580 



30 



Ser Leu Arg Glu Leu Ala Arg Lys Tyr Asn Val Pro Val Asp Phe He 
1585 1590 1595 1600 



35 



lie Asn Gin Leu Lys Gly Lys Asp lie Gly Leu Pro Arg Asn Phe Met 
1605 1610 1615 



Thr Phe Glu Glu Phe Leu Lys Glu Lys Val Val Asp Gly Lys Tyr Val 
1 620 1 625 1 630 

40 

Ser Glu Arg lie lie Lys Lys Glu Cys lie Gly Tyr Arg Asp Val Tyr 
1635 1640 1645 

45 Asp lie Thr Cys His Lys Asp Pro Ser Phe lie Ala Asn Gly Phe Val 
1650 1655 1660 

Ser His Asn Cys Asn Tyr Pro Ser Lys lie lie Pro Pro lie Gin Ser 
* 1665 1670 1675 1680 



55 



60 



Arg Cys Ala Val Phe Arg Phe Ser Pro Leu Lys Lys Glu Asp lie Ala 
1685 1690 1695 

Lys Lys Leu Lys Glu lie Ala Glu Lys Glu Gly Leu Asn Leu Thr Glu 
1700 1705 1710 

Ser Gly Leu Glu Ala lie He Tyr Val Ser Glu Gly Asp Met Arg Lys 
1715 1720 1725 



65 
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Ala lie Asn Val Leu Gin Thr Ala Ala Ala Leu Ser Asp Val lie Asp 
1730 1735 1740 

Asp Glu lie Val Tyr Lys Val Ser Ser Arg Ala Arg Pro Glu Glu Val 
1745 1750 1755 1760 

Lys Lys Met Met Glu Leu Ala Leu Asp Gly Lys Phe Met Glu Ala Arg 
1765 1770 1775 

Asp Leu Leu Tyr Lys Leu Met Val Glu Trp Gly Met Ser Gly Glu Asp 
1780 1785 1790 

lie Leu Asn Gin Met Phe Arg Glu lie Asn Ser Leu Asp lie Asp Glu 
1795 1800 1805 

Arg Lys Lys Val Glu Leu Ala Asp Ala lie Gly Glu Thr Asp Phe Arg 
1810 1815 1820 

He Val Glu Gly Ala Asn Glu Arg lie Gin Leu Ser Ala Leg Leu Ala 
1825 1830 1835 1840 

Lys Met Ala Leu Met Gly Arg 
1845 

(2) ANGABEN ZU SEQ ID NO: 4: 

(i) SEQUENZKENNZEICHEN: 

(A) LANGE: 855 Aminosauren 
{B) ART: Aminosaure 
(C) STRANGFORM: Einzelstrang 
ID) TOPOLOGIE: linear 

(ii) ART DES MOLEKULS: Protein 

(vi) URSPRUNLICHE HERKUNFT: 

(A) ORGANISMUS: Pyrococcus horikoshli 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 4: 

Met His Asn Met Glu Glu Val Arg Glu Val Lys Val Leu Glu Lys Pro 
1 5 10 15 

Trp Val Glu Lys Tyr Arg Pro Gin Arg Leu Asp Glu lie Val Gly Gin 
20 25 30 
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Glu His lie Val Lys Arg Leu Lys His Tyr Val Lys Thr Gly Ser Met 
35 40 45 

^ Pro His Leu Leu Phe Ala Gly Pro Pro Gly Val Gly Lys Cys Leu Thr 
50 55 60 

10 Gly Asp Thr Lys Val lie Ala Asn Gly Gin Leu Phe Glu Leu Arg Glu 
65 70 75 80 

Leu Val Glu Lys lie Ser Gly Gly Lys Phe Gly Pro Thr Pro Val Lys 
15 85 90 95 

Gly Leu Lys Val lie Gly lie Asp Glu Asp Gly Lys Leu Arg Glu Phe 
100 105 110 



20 



Glu Val Gin Tyr Val Tyr Lys Asp Lys Thr Glu Arg Leu lie Arg lie 
115 120 125 



25 



Arg Thr Arg Leu Gly Arg Glu Leu Lys Val Thr Pro Tyr His ^ro Leu 
130 135 140 V 



3^ Leu Val Asn Arg Arg Asn Gly Glu lie Lys Trp Val Lys Ala Glu Glu 
145 150 155 160 

Leu Lys Pro Gly Asp Lys Leu Ala Val Pro Arg Phe Leu Pro lie Val 
35 165 170 175 

Thr Gly Glu Asp Pro Leu Ala Glu Trp Leu Gly Tyr Phe Leu Gly Gly 
180 185 190 

40 

Gly Tyr Ala Asp Ser Lys Glu Asn Leu lie Met Phe Thr Asn Glu Asp 
195 200 205 

45 Pro Leu Leu Arg Gin Arg Phe Met Glu Leu Thr Glu Lys Leu Phe Ser 
210 215 220 

Asp Ala Arg lie Arg Glu lie Thr His Glu Asn Gly Thr Ser Lys Val 
^ 225 230 235 240 



55 



60 



Tyr Val Asn Ser Lys Lys Ala Leu Lys Leu Val Asn Ser Leu Gly Asn 
245 250 255 

Ala His lie Pro Lys Glu Cys Trp Arg Gly lie Arg Ser Phe Leu Arg 
260 265 270 

Ala Tyr Phe Asp Cys Asn Gly Gly Val Lys Gly Asn Ala lie Val Leu 
275 280 285 
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Ala Thr Ala Ser Lys Glu Met Ser Gin Glu lie Ala Tyr Ala Leu Ala 
290 295 300 

Gly Phe Gly lie lie Ser Arg lie Gin Glu Tyr Arg Val lie lie Ser 
305 310 315 320 

Gly Ser Asp Asn Val Lys Lys Phe Leu Asn Glu He Gly Phe lie Asn 
325 330 335 

Arg Asn Lys Leu Glu Lys Ala Leu Lys Leu Val Lys Lys Asp Asp Pro 
340 345 350 

Gly His Asp Gly Leu Glu lie Asn Tyr Glu Leu lie Ser Tyr Val Lys 
355 360 365 

Asp Arg Leu Arg Leu Ser Phe Phe Asn Asp Lys Arg Ser Trp Ser Tyr 
370 375 380 

Arg Glu Ala Lys Glu He Ser Trp Glu Leu IVlet Lys Glu lie Tyr Tyr 
385 390 395 400 

Arg Leu Asp Glu Leu Glu Lys Leu Lys Glu Ser Leu Ser Arg Gly He 
405 410 415 

Leu lie Asp Trp Asn Glu Val Ala Lys Arg lie Glu Glu Val Ala Glu 
420 425 430 

Glu Thr Gly lie Arg Ala Asp Glu Leu Leu Glu Tyr He Glu Gly Lys 
435 440 445 

Arg Lys Leu Ser Phe Lys Asp Tyr He Lys lie Ala Lys Val Leu Gly 
450 455 460 

He Asp Vai Glu His Thr lie Giu Ala Met Arg Val Phe Ala Arg Lys 
465 470 475 480 

Tyr Ser Ser Tyr Ala Giu He Gly Arg Arg Leu Gly Thr Trp Asn Ser 
485 490 495 

Ser Val Lys Thr He Leu Giu Ser Asn Ala Vai Asn Val Giu He Leu 
500 505 510 

Glu Arg lie Arg Lys He Glu Leu Glu Leu He Giu Glu lie Leu Ser 
515 520 525 

Asp Glu Lys Leu Lys Glu Gly lie Ala Tyr Leu lie Phe Leu Ser Gin 
530 535 540 
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Asn Glu Leu tyr Trp Asp Glu lie Thr Lys Val Glu Glu Leu Arg Gly 
545 550 555 560 

Glu Phe lie lie Tyr Asp Leu His Val Pro Gly Tyr His Asn Phe lie 
565 570 575 

,„ Ala Gly Asn Met Pro Thr Val Val His Asn Thr Thr Ala Ala Leu Ala 
580 585 590 

Leu Ser Arg Glu Leu Phe Gly Glu Asn Trp Arg His Asn Phe Leu Glu 
IS 595 600 605 

Leu Asn Ala Ser Asp Glu Arg Gly lie Asn Val lie Arg Glu Lys Val 
610 615 620 

20 

Lys Glu Phe Ala Arg Thr Lys Pro lie Gly Gly Ala Ser Phe Lys lie 
625 630 635 640 

^ lie Phe Leu Asp Glu Ala Asp Ala Leu Thr Gin Asp Ala Gin Gin Ala 
645 650 655 

^ Leu Arg Arg Thr Met Glu Met Phe Ser Ser Asn Val Arg Phe lie Leu 
660 665 670 

Ser Cys Asn Tyr Ser Ser Lys lie lie Glu Pro lie Gin Ser Arg Cys 
35 675 680 685 

Ala lie Phe Arg Phe Arg Pro Leu Arg Asp Glu Asp lie Ala Lys Arg 
690 695 700 

40 

Leu Arg Tyr lie Ala Glu Asn Glu Gly Leu Glu Leu Thr Glu Glu Gly 
705 710 715 720 

45 Leu Gin Ala lie Leu Tyr lie Ala Glu Gly Asp Met Arg Arg Ala lie 
725 730 735 

Asn lie Leu Gin Ala Ala Ala Ala Leu Asp Lys Lys lie Thr Asp Glu 
^ 740 745 750 

Asn Val Phe Met Val Ala Ser Arg Ala Arg Pro Glu Asp lie Arg Glu 
755 760 765 

Met Met Leu Leu Ala Leu Lys Gly Asn Phe Leu Lys Ala Arg Glu Lys 
770 775 780 

60 

Leu Arg Glu lie Leu Leu Lys Gin Gly Leu Ser Gly Glu Asp Val Leu 
785 790 795 800 
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lie Gin Met His Lys Giu Val Phe Asn Leu Pro tie Asp Giu Pro Thr 
805 810 815 

Lys Val Tyr Leu Ala Asp Lys lie Gly Giu Tyr Asn Phe Arg Leu Val 
820 825 830 

Giu Gly Ala Asn Giu Met lie Gin Leu Giu Ala Leu Leu Ala Gin Phe 
835 840 845 

Thr Leu Val Gly Lys Lys Lys 
850 855 

(2) ANGABEN ZU SEQ ID NO: 5: 

(i) SEQUENZKENNZEICHEN: 

(A) LANGE: 321 Aminosauren 

(B) ART: Aminosaure 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: linear 

(ii) ART DES MOLEKULS: Protein 

(vi) URSPRONLICHE HERKUNFT: 

(A) ORGANISMUS: Methanobacterium thermoautotrophicum 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 5: 

Met lie lie Met Asn Gly Pro Trp Val Giu Lys Tyr Arg Pro Gin Lys 
1 5 10 15 

Leu Asp Asp He Val Gly Gin Giu His lie He Pro Arg Leu Lys Arg 
20 25 30 

Tyr Val Giu Giu Lys Ser Met Pro Asn Leu Met Phe Thr Gly Pro Ala 
35 40 45 

Gly Val Gly Lys Thr Thr Ala Ala Leu Ala Leu Ala Arg Giu lie Leu 
50 55 60 

Gly Giu Tyr Trp Arg Gin Asn Phe Leu Giu Leu Asn Ala Ser Asp Ala 
65 70 75 80 

Arg Gly lie Asp Thr Val Arg Thr Ser lie Lys Asn Phe Cys Arg Leu 
85 90 95 
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Lys Pro Val Gly Ala Pro Phe Arg tie lie Phe Leu Asp Glu Val Asp 
100 105 110 

Asn Met Thr Lys Asp Ala Gin His Ala Leu Arg Arg Glu Met Glu Met 
115 120 125 

10 Tyr Thr Lys Thr Ser Ser Phe lie Leu Ser Cys Asn Tyr Ser Ser Lys 
130 135 140 

Me He Asp Pro lie Gin Ser Arg Cys Ala lie Phe Arg Phe Leu Pro 
15 145 150 155 160 

Leu Lys Gly His Gin lie lie Lys Arg Leu Glu Tyr lie Ala Glu Lys 
165 170 175 

20 

Glu Asn Leu Glu Tyr Glu Ala His Ala Leu Glu Thr lie Val Tyr Phe 
180 185 190 

^ Ala Glu Gly Asp Leu Arg Lys Ala lie Asn Leu Leu Gin Ser ^la Ala 
195 200 205 

3^, Ser Leu Gly Glu Lys lie Thr Glu Ser Ser lie Tyr Asp Val Val Ser 
210 215 220 

Arg Ala Arg Pro Lys Asp Val Arg Lys Met lie Lys Thr lie Leu Asp 
35 225 230 235 240 

Gly Lys Phe Met Glu Ala Arg Asp Met Leu Arg Glu lie Met Val Leu 
245 250 255 

40 

Gin Gly lie Ser Gly Glu Asp Met Val Thr Gin lie Tyr Gin Glu Leu 
260 265 270 

45 Ser Arg Leu Ala Met Glu Gly Glu Val Asp Gly Asp Arg Tyr Val Gly 
275 280 285 

Leu lie Asp Ala lie Gly Glu Tyr Asp Phe Arg He Arg Glu Gly Ala 
^ 290 295 300 

Asn Pro Arg lie Gin Leu Glu Ala Leu Leu Ala Arg Phe Leu Glu His 
55 305 310 315 320 

Ala 
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(2) ANGABEN ZU SEQ ID NO: 6: 

(i) SEQUENZKENNZEICHEN: 

(A) LANGE: 1148 Aminosauren 

(B) ART: Aminosaure 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: linear 

(ii).ART DBS MOLEKULS: Protein 

(vi) URSPRONLICHE HERKUNFT: 

(A) ORGANISMUS: Homo sapiens 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 6: 

Met Asp lie Arg Lys Phe Phe Gly Val lie Pro Ser Gly Lys Lys Leu 
1 5 10 15 

Val Ser Glu Thr Val Lys Lys Asn Glu Lys Thr Lys Ser Asp Glu Glu 
20 25 30 

Thr Leu Lys Ala Lys Lys Gly lie Lys Glu lie Lys Val Asn Ser Ser 
35 40 45 

Arg Lys Glu Asp Asp Phe Lys Gin Lys Gin Pro Ser Lys Lys Lys Arg 
50 55 60 

lie lie Tyr Asp Ser Asp Ser Glu Ser Glu Glu Thr Leu Gin Val Lys 
65 70 75 80 

Asn Ala Lys Lys Pro Pro Glu Lys Leu Pro Val Ser Ser Lys Pro Gly 
85 90 95 

Lys lie Ser Arg Gin Asp Pro Val Thr Tyr lie Ser Glu Thr Asp Glu 
100 105 110 

Glu Asp Asp Phe Met Cys Lys Lys Ala Ala Ser Lys Ser Lys Glu Asn 
115 120 125 

Gly Arg Ser Thr Asn Ser His Leu Gly Thr Ser Asn Met Lys Lys Asn 
130 135 140 

Glu Glu Asn Thr Lys Thr Lys Asn Lys Pro Leu Ser Pro lie Lys Leu 
145 150 155 160 



31 



19&(0771A1_L> 



DE 198 40 771 Al 

Thr Pro Thr Ser Val Leu Asp Tyr Phe Gly Thr Gly Ser Val Gin Arg 
165 170 175 

Ser Asn Lys Lys Met Val Ala Ser Lys Arg Lys Glu Leu Ser Gin Asn 
180 185 190 

10 Thr Asp Glu Ser Gly Leu Asn Asp Glu Ala lie Ala Lys Gin Leu Gin 
195 200 205 

Leu Asp Glu Asp Ala Glu Leu Glu Arg Gin Leu His Glu Asp Glu Glu 
15 210 215 220 

Phe Ala Arg Thr Leu Ala Met Leu Asp Glu Glu Pro Lys Thr Lys Lys 
225 230 235 240 

20 

Ala Arg Lys Asp Thr Glu Ala Gly Glu Thr Phe Ser Ser Val Gin Ala 
245 250 255 

2^ Asn Leu Ser Lys Ala Glu Lys His Lys Tyr Pro His Lys Val l^ys Thr 
260 265 270 V. 

Ala Gin Val Ser Asp Glu Arg Lys Ser Tyr Ser Pro Arg Lys Gin Ser 
275 280 285 



35 



Lys Tyr Glu Ser Ser Lys Glu Ser Gin Gin His Ser Lys Ser Ser Ala 
290 295 300 



Asp Lys lie Gly Glu Val Ser Ser Pro Lys Ala Ser Ser Lys Leu Ala 
305 310 315 320 

40 

He Met Lys Arg Lys Lys Glu Ser Ser Tyr Lys Glu lie Glu Pro Val 
325 330 335 

45 Ala Ser Lys Arg Lys Glu Asn Ala lie Lys Leu Lys Gly Glu Thr Lys 
340 345 350 



50 



Thr Pro Lys Lys Thr Lys Ser Ser Pro Ala Lys Lys Glu Ser Val Ser 
355 360 365 



55 



60 



Pro Glu Asp Ser Glu Lys Lys Arg Thr Asn Tyr Gin Ala Tyr Arg Ser 
370 375 380 

Tyr Leu Asn Arg Glu Gly Pro Lys Ala Leu Gly Ser Lys Glu lie Pro 
385 390 395 400 

Lys Gly Ala Glu Asn Cys Leu Glu Gly Leu lie Phe Val lie Thr Gly 
405 410 415 
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Val Leu Glu Ser lie Glu Arg Asp Glu Ala Lys Ser Leu He Glu Arg 
420 425 430 

Tyr Gly Gly Lys Val Thr Gly Asn Val Ser Lys Lys Thr Asn Tyr Leu 
435 440 445 

Val Met Gly Arg Asp Ser Gly Gin Ser Lys Ser Asp Lys Ala Ala Ala 
450 455 460 

Leu Gly Thr Lys lie He Asp Glu Asp Gly Leu Leu Asn Leu He Arg 
465 470 475 480 

Thr Met Pro Gly Lys Lys Ser Lys Tyr Glu He Ala Val Glu Thr Glu 
485 490 495 . 

Met Lys Lys Glu Ser Lys Leu Glu Arg Thr Pro Gin Lys Asn Val Gin 
500 505 510 

Gly Lys Arg Lys He Ser Pro Ser Lys Lys Glu Ser Glu Ser Ly^ Lys 
515 520 525 

Ser Arg Pro Thr Ser Lys Arg Asp Ser Leu Ala Lys Thr He Lys Lys 
530 535 540 

Glu Thr Asp Val Phe Trp Lys Ser Leu Asp Phe Lys Glu Gin Val Ala 
545 550 555 560 

Glu Glu Thr Ser Gly Asp Ser Lys Ala Arg Asn Leu Ala Asp Asp Ser 
565 570 575 

Ser Glu Asn Lys Val Glu Asn Leu Leu Trp Val Asp Lys Tyr Lys Pro 
580 585 590 

Thr Ser Leu Lys Thr He He Gly Gin Gin Gly Asp Gin Ser Cys Ala 
595 600 605 

Asn Lys Leu Leu Arg Trp Leu Arg Asn Trp Gin Lys Ser Ser Ser Glu 
610 615 620 

Asp Lys Lys His Ala Ala Lys Phe Gly Lys Phe Ser Gly Lys Asp Asp 
625 630 635 640 

Gly Ser Ser Phe Lys Ala Ala Leu Leu Ser Gly Pro Pro Gly Val Gly 
645 650 655 

Lys Thr Thr Thr Ala Ser Leu Val Cys Gin Glu Leu Gly Tyr Ser Tyr 
660 665 670 
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Val Glu Leu Asn Ala Ser Asp Thr Arg Ser Lys Ser Ser Leu Lys Ala 
675 680 685 

lie Val Ala Glu Ser Leu Asn Asn Thr Ser lie Lys Gly Phe Tyr Ser 
690 695 700 



lU 



Asn Gly Ala Ala Ser Ser Val Ser Thr Lys His Ala Leu lie Met Asp 
705 710 715 720 



IS 



Glu Val Asp Gly Met Ala Gly Asn Glu Asp Arg Gly Gly lie Gin Glu 
725 730 735 



20 



Leu lie Gly Leu lie Lys His Thr Lys lie Pro lie lie Cys Met Cys 
740 745 750 

Asn Asp Arg Asn His Pro Lys lie Arg Ser Leu Val His Tyr Cys Phe 
755 760 765 



25 



Asp Leu Arg Phe Gin Arg Pro Arg Val Glu Gin lie Lys Gly Ala Met 
770 775 780 



30 



Met Ser lie Ala Phe Lys Glu Gly Leu Lys lie Pro Pro Pro Ala Met 
785 790 795 800 



35 



Asn Glu lie lie Leu Gly Ala Asn Gin Asp He Arg Gin Val Leu His 
805 810 815 



Asn Leu Ser Met Trp Cys Ala Arg Ser Lys Ala Leu Thr Tyr Asp Gin 
820 825 830 

40 

Ala Lys Ala Asp Ser His Arg Ala Lys Lys Asp lie Lys Met Gly Pro 
835 840 845 

45 Phe Asp Val Ala Arg Lys Val Phe Ala Ala Gly Glu Glu Thr Ala His 
850 855 860 



50 



Met Ser Leu Val Asp Lys Ser Asp Leu Phe Phe His Asp Tyr Ser lie 
865 870 875 880 



55 



Ala Pro Leu Phe Val Gin Glu Asn Tyr lie His Val Lys Pro Val Ala 
885 890 895 

Ala Gly Gly Asp Met Lys Lys His Leu Met Leu Leu Ser Arg Ala Ala 
900 905 910 

Asp Ser lie Cys Asp Gly Asp Leu Val Asp Ser Gin lie Arg Ser Lys 
915 920 925 
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Gin Asn Trp Ser Leu Leu Pro Ala Gin Ala He Tyr Ala Ser Val Leu 
930 935 940 

Pro Gly Glu Leu Met Arg Gly Tyr Met Thr Gin Phe Pro Thr Phe Pro 
945 950 955 960 

Ser Trp Leu Gly Lys His Ser Ser Thr Gly Lys His Asp Arg lie Val 
965 970 975 

Gin Asp Leu Ala Leu His Met Ser Leu Arg Thr Tyr Ser Ser Lys Arg 
980 985 990 

Thr Val Asn Met Asp Tyr Leu Ser Leu Leu Arg Asp Ala Leu Val Gin 
995 1000 1005 

Pro Leu Thr Ser Gin Gly Val Asp Gly Val Gin Asp Val Val Ala Leu 
1010 1015 1020 

Met Asp Thr Tyr Tyr Leu Met Lys Glu Asp Phe Glu A^n lie Met Glu 
1025 1030 1035 1040 

He Ser Ser Trp Gly Gly Lys Pro Ser Pro Phe Ser Lys Leu Asp Pro 
1045 1050 1055 

Lys Val Lys Ala Ala Phe Thr Arg Ala Tyr Asn Lys Glu Ala His Leu 
1060 1065 1070 

Thr Pro Tyr Ser Leu Gin Ala He Lys Ala Ser Arg His Ser Thr Ser 
1075 1080 1085 

Pro Ser Leu Asp Ser Glu Tyr Asn Glu Glu Leu Asn Glu Asp Asp Ser 
1090 1095 1100 

Gin Ser Asp Glu Lys Asp Gin Asp Ala He Glu Thr Asp Ala Met He 
1105 1110 1115 1120 

Lys Lys Lys Thr Lys Ser Ser Lys Pro Ser Lys Pro Glu Lys Asp Lys 
1125 1130 1135 

Glu Pro Arg Lys Gly Lys Gly Lys Ser Ser Lys Lys 
1140 1145 

(2) ANGABEN ZU SEQ ID NO: 7: 

(i) SEQUENZKENNZEICHEN: 

(A) LANGE: 479 Aminosauren 

(B) ART: Aminosaure 

(C) STRANGFORM: Einzelstrang 
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(D) TOPOLOGIE: linear 

(ii) ART DES MOLEKULS: Protein 

(vi) URSPRONLICHE HERKUNFT: 

(A) ORGANISMUS: Archaeoglobus fulgidus 

10 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 7: 

15 

Met Leu Trp Val GIu Lys Tyr Arg Pro Lys Thr Leu Glu Glu Val Val 
1 5 10 15 

20 Ala Asp Lys Ser lie lie Thr Arg Val lie Lys Trp Ala Lys Ser Trp 
20 25 30 

Lys Arg Gly Ser Lys Pro Leu Leu Leu Ala Gly Pro Pro Gly Val GIv 
35 40 45 : 

Lys Thr Ser Leu Ala Leu Ala Leu Ala Asn Thr Met Gly Trp Glu Ala 
30 50 55 60 

Val Glu Leu Asn Ala Ser Asp Gin Arg Ser Trp Arg Val lie Glu Arg 
65 70 75 80 

35 

lie Val Gly Glu Gly Ala Phe Asn Glu Thr lie Ser Asp Glu Gly Glu 
85 90 95 

40 Phe Leu Ser Ser Arg lie Gly Lys Leu Lys Leu lie lie Leu Asp Glu 
100 105 no 

Val Asp Asn He His Lys Lys Glu Asp Val Gly Gly Glu Ala Ala Leu 
115 120 125 

lie Arg Leu lie Lys Arg Lys Pro Ala Gin Pro Leu lie Leu lie Ala 
130 135 140 

50 

Asn Asp Pro Tyr Lys Leu Ser Pro Glu Leu Arg Asn Leu Cys Glu Met 
145 150 155 160 

lie Asn Phe Lys Arg Leu Thr Lys Gin Gin Val Ala Arg Val Leu Glu 
165 170 175 

^ Arg lie Ala Leu Lys Glu Gly lie Lys Val Asp Lys Ser Val Leu Leu 
180 185 190 



55 
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Lys He Ala Glu Asn Ala Gly Gly Asp Leu Arg Ala Aia He Asn Asp 
195 200 205 

Phe Gin Ala Leu Ala Glu Gly Lys Glu Glu Leu Lys Pro Glu Asp Val ^ 
210 215 220 

Phe Leu Thr Lys Arg Thr Gin Glu Lys Asp He Phe Arg Val Met Gin 
225 230 235 240 

Met He Phe Lys Thr Lys Asn Pro Ala Val Tyr Asn Glu Ala Met Leu 

245 250 255 ,5 

Leu Asp Glu Ser Pro Glu Asp Val He His Trp Val Asp Glu Asn Leu 
260 265 270 

20 

Pro Leu Glu Tyr Ser Gly Val Glu Leu Val Asn Ala Tyr Glu Ala Leu 
275 280 285 

Ser Arg Ala Asp lie Phe Leu Gly Arg Val Arg Arg Arg Gin ^he Tyr ^ 
290 295 300 

Arg Leu Trp Lys Tyr Ala Ser Tyr Leu Met Thr Val Gly Val Gin Gin 

305 310 315 320 ^ 

Met Lys Glu Glu Pro Lys Lys Gly Phe Thr Arg Tyr Arg Arg Pro Ala 

325 330 335 3. 

Val Trp Gin Met Leu Phe Gin Leu Arg Gin Lys Arg Glu Met Thr Arg 
340 345 350 

40 

Lys He Leu Glu Lys lie Gly Lys Tyr Ser His Leu Ser Met Arg Lys 
355 360 365 

Ala Arg Thr Glu Met Phe Pro Val He Lys Leu Leu Leu Lys Glu Leu 45 
370 375 380 

Asp Val Asp Lys Ala Ala Thr He Ala Ala Phe Tyr Glu Phe Thr Lys 

385 390 395 400 ^o 

Glu Glu Leu Glu Phe Leu Val Gly Glu Lys Gly Asp Glu He Trp Lys 
405 410 415 

... . 55 

Tyr Val Glu Lys His Gly Met His Arg He Glu Asp Glu Thr Phe Leu 
420 425 430 

Glu Ser Phe Val Lys Ala Glu Lys Glu Glu Lys Glu Glu Ser Val Glu 
435 440 445 
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Glu Val Ala Glu Glu Lys Pro Glu G\u Glu Arg Glu GIu Pro Arg Ala 
450 455 460 

Arg Lys Lys Ala Gly Lys Asn Leu Thr Leu Asp Ser Phe Phe Ser 
465 470 475 

(2) ANGABEN ZU SEQ ID NO: 8: 

(i) SEQUENZKENNZEICHEN: 

(A) LAIMGE: 516 Aminosauren 

(B) ART: Aminosaure 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: linear 

lii) ART DES MOLEKOLS: Protein 

(vi) URSPRUNLICHE HERKUNFT: 

(A) ORGANISMUS: Methanococcus jannaschii 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 8: 

Met Leu Ser Trp Val Glu Lys Tyr Arg Pro Lys Ser Leu Lys Asp Val 
15 .10 15 

Ala Gly His Glu Lys Val Lys Glu Lys Leu Lys Thr Trp lie Glu Ser 
20 25 30 

Tyr Leu Lys Gly Glu Thr Pro Lys Pro lie Leu Leu Val Gly Pro Pro 
35 40 45 

Gly Cys Gly Lys Thr Thr Leu Ala Tyr Ala Leu Ala Asn Asp Tyr Gly 
50 55 60 

Phe Glu Val lie Glu Leu Asn Ala Ser Asp Lys Arg Asn Ser Ser Ala 
65 70 75 80 

lie Lys Lys Val Val Gly His Ala Ala Thr Ser Ser Ser lie Phe Gly 
85 90 95 

Lys Lys Phe Leu lie Val Leu Asp Glu Val Asp Gly lie Ser Gly Lys 
100 105 110 

Glu Asp Ala Gly Gly Val Ser Glu Leu lie Lys Val lie Lys Lys Ala 
115 120 125 
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Lys Asn Pro lie lie Leu Thr Ala Asn Asp Ala Tyr Ala Pro Ser lie 
130 135 140 

Arg Ser Leu Leu Pro Tyr Val Glu Val lie Gin Leu Asn Pro Val His 
145 150 155 160 

Thr Asn Ser Val Tyr Lys Val Leu Lys Lys lie Ala Glu Lys Glu Gly 
165 170 175 



Leu Asp Val Asp Asp Lys Thr Leu Lys Met lie Ala Gin His Ser Ala 
180 185 190 



Gly Asp Leu Arg Ser Ala He Asn Asp Leu Glu Ala Leu Ala Leu Ser 
195 200 205 

Gly Asp Leu Ser Tyr Glu Ala Ala Gin Lys Leu Pro Asp Arg Lys Arg 
210 215 220 

Glu Ala Asn fie Phe Asp Ala Leu Arg Val lie Leu Lys Thr Tjir His 
225 230 235 240 

Tyr Gly lie Ala Thr Thr Ala Leu Met Asn Val Asp Glu Thr Pro Asp 
245 250 255 

Val Val lie Glu Trp lie Ala Glu Asn Val Pro Lys Glu Tyr Glu Lys 
260 265 270 

Pro Glu Glu Val Ala Arg Ala Phe Glu Tyr Leu Ser Lys Ala Asp Arg 
275 280 285 

Tyr Leu Gly Arg Val Met Arg Arg Gin Asn Tyr Ser Phe Trp Lys Tyr 
290 295 300 

Ala Thr Thr Leu Met Thr Ala Gly Val Ala Leu Ser Lys Asp Glu Lys 
305 310 315 320 

Tyr Arg Lys Trp Thr Pro Tyr Ser Tyr Pro Lys lie Phe Arg Leu Leu 
325 330 335 

Thr Lys Thr Lys Ala Glu Arg Glu lie Leu Asn Lys lie Leu Lys Lys 
340 345 350 

lie Gly Glu Lys Thr His Thr Ser Ser Lys Arg Ala Arg Phe Asp Leu 
355 360 365 

Gin Met Leu Lys Leu Leu Ala Lys Glu Asn Pro Ser Val Ala Ala Asp 
370 375 380 
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30 



35 
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55 
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Leu Val Asp Tyr Phe Glu lie Lys Glu Asp Glu Leu Lys Val Leu Val 
385 390 395 400 

Gly Asp Lys Leu Ala Ser Glu lie Leu Lys lie Leu Lys Glu Lys Lys 
405 410 415 

Lys Leu Glu Arg Lys Lys Lys Lys Glu Lys Glu Lys Leu Glu Lys Glu 
420 425 430 



Lys Lys Lys Glu Glu Lys Ala Lys Glu Lys Gin Ser Asn Leu lie lie 
IS 435 440 445 

Gin Pro Lys Glu lie Lys Glu Glu Val Lys Ala Glu Val Glu Lys Lys 
450 455 460 



Glu Glu Val Lys Glu Lys lie Val Glu Lys Pro Lys Ala Glu Glu Val 
"^65 470 475 480 

Lys Glu Lys Ser Lys Thr Glu Glu Lys Glu Thr Lys Lys A>p Lys Lys 
485 490 495 

Lys Gly Lys Lys Lys Lys Glu Asp Lys Gly Lys Gin Leu Thr Leu Asp 
500 505 510 

Ala Phe Phe Lys 
515 

12) ANGABEN ZU SEQ ID NO: 9: 

40 li) SEQUENZKENNZEICHEN: 

(A) LANGE: 468 Aminosauren 

(B) ART: Aminosaure 

(C) STRANGFORM: Einzelstrang 
45 (D) TOPOLOGIE: linear 

(ii) ART DES MOLEKULS: Protein 



(vi) URSPRONLICHE HERKUNFT: 

(A) ORGANISMUS: Pyrococcus horikoshii 



(xl) SEQUENZBESCHREIBUNG: SEQ ID NO: 9: 

Met Pro Asp Val Pro Trp lie Glu Lys Tyr Arg Pro Arg Lys Leu Ser 
15 10 15 
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Glu lie Val Asn Gin Glu Gin Ala Leu Gtu Lys Vai Arg Ala Trp lie 
20 25 30 



Glu Ser Trp Leu His Gly Asn Pro Pro Lys Lys Lys Ala Leu Leu Leu 
35 40 45 

Ala Gly Pro Pro Gly Ser Gly Lys Thr Thr Thr Val Tyr Ala Leu Ala 
50 55 60 

His Glu Tyr Asn Phe Glu Val lie Glu Leu Asn Ala Ser Asp Glu Arg 
65 70 75 80 

Thr Tyr Asn Lys lie Ala Arg Tyr Val Gin Ala Ala Tyr Thr Met Asp 
85 90 95 

He Met Gly Lys Arg Arg Lys He He Phe Leu Asp Glu Ala Asp Asn 
100 105 110 

■w. 

lie Glu Pro Ser Gly Ala Pro Glu lie Ala Lys Leu lie Asp Lys ^la 
115 120 125 

Arg Asn Pro lie lie Met Ala Ala Asn His Tyr Trp Glu Val Pro Lys 
130 135 140 

Glu He Arg Asp Arg Ala Glu Leu Val Glu Tyr Lys Arg Leu Asn Gin 
145 150 155 160 

Arg Asp Val lie Ser Ala Leu Val Arg lie Leu Lys Arg Glu Gly He 
165 170 175 

Thr Val Pro Lys Glu He Leu Thr Glu He Ala Lys Arg Ser Ser Gly 
180 185 190 

Asp Leu Arg Ala Ala lie Asn Asp Leu Gin Thr He Val Ala Gly Gly 
1 95 200 205 

Tyr Glu Asp Ala Lys Tyr Val Leu Ala Tyr Arg Asp Val Glu Lys Thr 
210 215 220 

Val Phe Gin Ser Leu Gly Met Val Phe Ser Ser Asp Asn Ala Lys Arg 
225 230 235 240 

Ala Lys Leu Ala Leu Met Asn Leu Asp Met Ser Pro Asp Glu Phe Leu 
245 250 255 

Leu Trp Val Asp Glu Asn He Pro His Met Tyr Leu Lys Pro Glu Glu 
260 265 270 
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Met Ala Arg Ala Tyr Glu Ala lie Ser Arg Ala Asp lie Tyr Leu Gly 
275 280 285 



Arg Ala Gin Arg Thr Gly Asn Tyr Ser Leu Trp Lys Tyr Ala lie Asp 
290 295 300 

Met Met Thr Ala Gly Val Ala Val Ala Gly Thr Lys Lys Lys Gly Phe 
305 310 315 320 

Ala Lys Phe Tyr Pro Pro Asn Thr Leu Lys Met Leu Ala Glu Ser Lys 
325 330 335 

Glu Glu Arg Ser lie Arg Asp Ser lie lie Lys Lys He Met Lys Glu 
340 345 350 

Met His Met Ser Lys Leu Glu Ala Leu Glu Thr Met Lys lie Leu Arg 
355 360 365 

Thr lie Phe Glu Asn Asn Leu Asp Leu Ala Ala His Phe Thr Val Phe 
370 375 380 

Leu Glu Leu Thr Glu Lys Glu Val Glu Phe Leu Ala Gly Lys Glu Lys 
385 390 395 400 

Ala Gly Thr lie Trp Gly Lys Thr Leu Ser lie Arg Arg Arg lie Lys 
405 410 415 

Glu Thr Glu Lys lie Glu Glu Lys Ala Val Glu Glu Lys Val Glu Glu 
420 425 430 

Glu Glu Ala Glu Glu Glu Glu Glu Glu Glu Arg Lys Glu Glu Glu Lys 
435 440 445 

Pro Lys Ala Glu Lys Lys Lys Gly Lys Gin Val Thr Leu Phe Asp Phe 
450 455 460 

lie Lys Lys Asn 
465 



(2) ANGABEN ZU SEQ ID NO: 10: 

(i) SEQUEN2KENN2EICHEN: 

(A) LANGE: 479 Aminosauren 

(B) ART: Aminosgure 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: linear 

(li) ART DES MOLEKULS: Protein 
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(vi) URSPRUNLICHE HERKUNFT: 

(A) ORGANISMUS: Methanobacterium thermoautotrrophicum 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 10: 

Met Ser Trp Thr Glu Lys Tyr Arg Pro Gly Ser Phe Asp Glu Val Val 
1 5 10 15 

Gly Asn Gin Lys Val lie Ala Glu lie Lys Glu Trp lie Lys Ala Trp 
20 25 30 

Lys Ala Gly Lys Pro Gin Lys Pro Leu Leu Leu Val Gly Pro Pro Gly 
35 40 45 

Thr Gly Lys Thr Thr Leu Ala His lie lie Gly Lys Glu Phe Ser Asp 
50 55 60 ^ 

Thr Leu Glu Leu Asn Ala Ser Asp Arg Arg Ser Gin Asp AiaXeu Met 
65 70 75 80 

Arg Ser Ala Gly Glu Ala Ser Ala Thr Arg Ser Leu Phe Asn His Asp 
85 90 95 

Leu Lys Leu lie lie Leu Asp Glu Val Asp Gly lie His Gly Asn Glu 
100 105 110 

Asp Arg Gly Gly Val Gin Ala lie Asn Arg lie lie Lys Glu Ser Arg 
115 120 125 

His Pro Met Val Leu Thr Ala Asn Asp Pro Tyr Ser Lys Arg Leu Gin 
130 135 140 

Ser lie Lys Pro Arg Cys Arg Val Leu Asn Leu Arg Lys Val His Thr 
145 150 155 160 

Ser Ser lie Ala Ala Ala Leu Arg Arg He Cys Arg Ala Glu Gly lie 
165 170 175 

Glu Cys Pro Asp Asp Val Leu Arg Glu Leu Ala Lys Arg Ser Arg Gly 
180 185 190 

Asp Leu Arg Ser Ala lie Asn Asp Leu Glu Ala Met Ala Glu Gly Glu 
195 200 205 

Glu Arg lie Gly Glu Glu Leu Leu Lys Met Gly Glu Lys Asp Ala Thr 
210 215 220 
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Ser Asn Leu Phe Asp Ala Val Arg Ala Val Leu Lys Ser Arg Asp Val 
225 230 235 240 

Ser Lys Val Arg Glu Ala Met Arg Val Asp Asp Asp Pro Thr Leu Val 
245 250 255 

,„ Leu Glu Phe lie Ala Glu Asn Val Pro Arg Glu Tyr Glu Lys Pro Asn 
260 265 270 

Glu lie Ser Arg Ala Tyr Asp Met Leu Ser Arg Ala Asp lie Phe Phe 
IS 275 280 285 

Gly Arg Ala Val Arg Thr Arg Asn Tyr Thr Tyr Trp Arg Tyr Ala Ser 
290 295 300 

20 

Glu Leu Met Gly Pro Gly Val Ala Leu Ala Lys Asp Lys Thr Tyr Arg 
305 310 315 320 

^ Lys Phe Val Arg Tyr Thr Gly Ser Ser Ser Phe Arg lie Leu Qly Lys 
325 330 335 

^ Thr Arg Lys Gin Arg Ser Leu Arg Asp Ser Val Ala Ala Lys Met Ala 
340 345 350 

Gly Lys Met His lie Ser Pro Lys Val Ala lie Ser Met Phe Pro Tyr 
35 355 360 365 

Met Glu lie Leu Phe Glu Asn Asp Glu Met Ala Tyr Asp lie Ser Glu 
370 375 380 

40 

Phe Leu Glu Leu Arg Asp Glu Glu lie Lys Leu Phe Arg Lys Arg Lys 
385 390 395 400 

45 lie Lys Ala Pro Lys Arg Lys Lys Thr Pro Arg Lys Ala Glu lie Lys 
405 410 415 

Val Gly Pro Leu Tyr Ser Gin Lys Lys Asp Lys Gly Ala Asp Lys Ser 
420 425 430 

lie Asn Asp Lys Ala Thr Asp Lys Ser Ala Lys Thr Pro lie Lys Ser 
435 440 445 

Ser Lys Lys Asp Asp Arg Pro Arg Asp Glu Ser Ser Ser Ser Ser Asp 
450 455 460 

60 

Asp Lys Lys Pro Lys Glu Lys Gin Thr Ser Leu Phe Gin Phe Ser 
465 470 475 
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(2) ANGABEN ZU SEQ ID NO: 1 1: 

(i) SEQUENZKENNZEICHEN: 

(A) LANGE: 261 Aminosauren 

(B) ART: Aminosaure 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: linear 

(il) ART DES MOLEKULS: Protein 

(vi) URSPRONLICHE HERKUNFT: 

(A) ORGANISMUS: Homo sapiens 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 11: 
Met Phe Glu Ala Arg Leu Val Gin Gly Ser He Leu Lys Lys Val Leu 

1 5 10 15 ^; 

Glu Ala Leu Lys Asp Leu lie Asn Glu Ala Cys Trp Asp lie Ser Ser 
20 25 30 

Ser Gly Val Asn Leu Gin Ser Met Asp Ser Ser His Val Ser Leu Val 
35 40 45 

Gin Leu Thr Leu Arg Ser Glu Gly Phe Asp Thr Tyr Arg Cys Asp Arg 
50 55 60 

Asn Leu Ala Met Gly Val Asn Leu Thr Ser Met Ser Lys lie Leu Lys 
65 70 .75 80 

Cys Ala Gly Asn Glu Asp lie lie Thr Leu Arg Ala Glu Asp Asn Ala 
85 90 95 

Asp Thr Leu Ala Leu Val Phe Glu Ala Pro Asn Gin Glu Lys Val Ser 
100 105 110 

Asp Tyr Glu Met Lys Leu Met Asp Leu Asp Val Glu Gin Leu Gly lie 
115 120 125 

Pro Glu Gin Glu Tyr Ser Cys Val Val Lys Met Pro Ser Gly Glu Phe - . 
130 135 140 

Ala Arg lie Cys Arg Asp Leu Ser His lie Gly Asp Ala Val Val lie 
145 150 155 160 
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Ser Cys Ala Lys Asp Gly Val Lys Phe Ser Ala Ser Gly Glu Leu Gly 
165 170 175 

Asn Gly Asn lie Lys Leu Ser Gin Thr Ser Asn Val Asp Lys Glu Glu 
180 185 190 

Glu Ala Val Thr He Glu Met Asn Glu Pro Val Gin Leu Thr Phe Ala 
195 200 205 

Leu Arg Tyr Leu Asn Phe Phe Thr Lys Ala Thr Pro Leu Ser Ser Thr 
210 215 220 

Val Thr Leu Ser Met Ser Ala Asp Val Pro Leu Val Val Glu Tyr Lys 
225 230 235 240 

lie Ala Asp Met Gly His Leu Lys Tyr Tyr Leu Ala Pro Lys He Glu 
245 250 255 

Asp Glu Glu Gly Ser 
260 

(2) ANGABEN ZU SEQ ID NO: 12: 

(i) SEQUENZKENNZEICHEN: 

(A) LANGE: 245 Aminosauren 

(B) ART: Aminosaure 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: linear 

(ii) ART DES MOLEKOLS: Protein 

(vi) URSPRUNLICHE HERKUNFT: 

(A) ORGANISMUS: Archaeoglobus fulgidus 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 12: 

Met He Asp Val lie Met Thr Gly Glu Leu Leu Lys Thr Val Thr Arg 
15 10 15 

Ala lie Val Ala Leu Val Ser Glu Ala Arg He His Phe Leu Glu Lys 
20 25 30 

Gly Leu His Ser Arg Ala Val Asp Pro Ala Asn Val Ala Met Val He 
35 40 45 
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Val Asp lie Pro Lys Asp Ser Phe Glu Val Tyr Asn lie Asp Giu Glu 
50 55 60 

Lys Thr lie Gly Val Asp Met Asp Arg He Phe Asp lie Ser Lys Ser 
65 70 75 80 

lie Ser Thr Lys Asp Leu Val Glu Leu lie Val Glu Asp Glu Ser Thr 
85 90 95 

Leu Lys Val Lys Phe Gly Ser Val Glu Tyr Lys Val Ala Leu lie Asp 
100 105 110 

Pro Ser Ala lie Arg Lys Glu Pro Arg lie Pro Glu Leu Glu Leu Pro 
115 120 125 

Ala Lys lie Val Met Asp Ala Gly Glu Phe Lys Lys Ala lie Ala Ala 
130 135 140 

Ala Asp Lys lie Ser Asp Gin Val lie Phe Arg Ser Asp Lys Glu Gly 
145 150 155 160 

Phe Arg lie Glu Ala Lys Gly Asp Val Asp Ser lie Val Phe His Met 
165 170 175 

Thr Glu Thr Glu Leu lie Glu Phe Asn Gly Gly Glu Ala Arg Ser Met 
180 185 190 

Phe Ser Val Asp Tyr Leu Lys Glu Phe Cys Lys Val Ala Gly Ser Gly 
195 200 205 

Asp Leu Leu Thr lie His Leu Gly Thr Asn Tyr Pro Val Arg Leu Val 
210 215 220 

Phe Glu Leu Val Gly Gly Arg Ala Lys Val Glu Tyr lie Leu Ala Pro 
225 230 235 240 

Arg lie Glu Ser Glu 
245 

(2) ANGABEN ZU SEQ ID NO: 13: 

(i) SEQUENZKENNZEICHEN: 

(A) LANGE: 247 Aminosauren 

(B) ART: Aminosaure 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: linear 

(ii) ART DES MOLEKULS: Protein 
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(vi) URSPRUNLICHE HERKUNFT: 

(A) ORGANISMUS: Methanococcus jannaschii 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 13: 

10 

Met Phe Arg Gly Val Met Glu Ser Ala Lys Glu Phe Lys Lys Val Val 
1 5 10 15 

15 Asp Thr lie Ser Thr Leu Leu Asp Glu lie Cys Phe Glu Val Asp Glu 
20 25 30 

Glu Gly lie Lys Ala Ser Ala Met Asp Pro Ser His Val Ala Leu Val 
35 40 45 

Ser Leu Glu lie Pro Arg Leu Ala Phe Glu Glu Tyr Glu Ala Asp Ser 
50 55 60 

25 

His Asp He Gly He Asp Leu Glu Ala Phe Lys Lys Val Met Xfen Arg 
65 70 75 80 

30 

Ala Lys Ala Lys Asp Arg Leu lie Leu Glu Leu Asp Glu Glu Lys Asn 
85 90 95 

3$ Lys Leu Asn Val He Phe Glu Asn Thr Gly Lys Arg Lys Phe Ser Leu 
100 105 110 

Ala Leu Leu Asp He Ser Ala Ser Ser Val Lys Val Pro Glu He Glu 
40 115 120 125 

Tyr Pro Asn Val He Met He Lys Gly Asp Ala Phe Lys Glu Ala Leu 
130 135 140 

45 

Lys Asp Ala Asp Leu Phe Ser Asp Tyr Val He Leu Lys Ala Asp Glu 
145 150 155 160 

^ Asp Lys Phe Val He His Ala Lys Gly Asp Leu Asn Glu Asn Glu Ala 
165 170 175 

He Phe Glu Lys Asp Ser Ser Ala He He Ser Leu Glu Val Lys Glu 
180 185 190 

Glu Ala Lys Ser Ala Phe Asn Leu Asp Tyr Leu Met Asp Met Val Lys 
60 195 200 205 

Gly Val Ser Ser Gly Asp lie He Lys He Tyr Leu Gly Asn Asp Met 
210 215 220 

6S 
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Pro Leu Lys Leu Glu Tyr Ser lie Ala Gly Val Asn Leu Thr Phe Leu 
225 230 235 240 

Leu Ala Pro Arg lie Glu Gly 
245 

(2) ANGABEN 2U SEQ ID NO: 14: 

(i) SEQUENZKENNZEICHEN: 

(A) LANGE: 249 Amlnosauren 

(B) ART: Aminosaure 

(C) STRANGFORM: Einzelstrang 
ID) TOPOLOGIE: linear 

(ii) ART DES MOLEKULS: Protein 

(vl) URSPRONLICHE HERKUNFT: 

(A) ORGANISMUS: Pyrococcus horikoshii - 



(xl) SEQUEN2BESCHREIBUNG: SEQ ID NO: 14: 

Met Pro Phe Glu He Val Phe Glu Gly Ala Lys Glu Phe Ala Gin Leu 
15 10 15 

lie Glu Thr Ala Ser Arg Leu lie Asp Glu Ala Ala Phe Lys Val Thr 
20 25 30 

Glu Glu Gly He Ser Met Arg Ala Met Asp Pro Ser Arg Val Val Leu 
35 40 45 

lie Asp Leu Asn Leu Pro Ser Ser lie Phe Ser Lys Tyr Glu Val Asd 
50 55 60 

Gly Glu Glu Thr lie Gly Val Asn Met Asp His Leu Lys Lys Val Leu 
65 70 75 80 

Lys Arg Gly Lys Ala Lys Asp Thr Leu lie Leu Arg Lys Gly Glu Glu 
85 90 95 

Asn Phe Leu Glu lie Ser Leu Gin Gly Thr Ala Thr Arg Thr Phe Arg 
100 105 110 

Leu Pro Leu lie Asp Val Glu Glu lie Glu Val Glu Leu Pro Asp Leu 
115 120 125 
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Pro Tyr Thr Ala Lys Val Val Val Leu Gly Glu Val Leu Lys Glu Ala 
130 135 140 

Val Lys Asp Ala Ser Leu Val Ser Asp Ser He Lys Phe Met Ala Lys 
145 ISO 155 160 

Glu Asn Glu Phe lie Met Arg Ala Glu Gly Glu Thr Gin Glu Val Glu 
165 170 175 

Val Lys Leu Thr Leu Glu Asp Glu Gly Leu Leu Asp lie Glu Val Gin 
180 185 190 

Glu Glu Thr Lys Ser Ala Tyr Gly Val Ser Tyr Leu Ala Asp Met Val 
195 200 205 

Lys Gly lie Gly Lys Ala Asp Glu Val Thr Met Arg Phe Gly Asn Glu 
210 215 220 

Met Pro Met Gin Met Glu Tyr Tyr lie Arg Asp Glu Gly Arg Leu Thr 
225 230 235 240 V 

Phe Leu Leu Ala Pro Arg Val Glu Glu 
245 

(2) ANGABEN ZU SEQ ID NO: 15: 

(i) SEQUENZKENNZEICHEN: 

(A) LANGE: 244 Aminosauren 

IB) ART: Aminosaure 

IC) STRANGFORM: EInzelstrang 
(D) TOPOLOGIE: linear 

Hi) ART DES MOLEKOLS: Protein 

(vi) URSPRUNLICHE HERKUNFT: 

(A) ORGANISMUS: Methanobacterium thermoautotrophicum 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 1 5: 

Met Phe Lys Ala Glu Leu Asn Asp Pro Asn lie Leu Arg Thr Ser Phe 
15 10 15 

Asp Ala lie Ser Ser lie Val Asp Glu Val Gin He Gin Leu Ser Ala 
20 25 30 
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Glu Gly Leu Arg Leu Asp Ala Leu Asp Arg Ser His lie Thr Tyr Val 
35 40 45 

His Leu Glu Leu Lys Ala Glu Leu Phe Asp Glu Tyr Val Cys Asp Glu 
50 55 60 

Pro Glu Arg lie Asn Val Asp Thr Glu Glu Leu Met Lys Val Leu Lys 
65 70 75 80 

Arg Ala Lys Ala Asn Asp Arg Val lie Leu Ser Thr Asp Glu Gly Asn 
85 90 95 

Leu lie lie Gin Phe Glu Gly Glu Ala Val Arg Thr Phe Lys lie Arg 
100 105 110 

Leu lie Asp He Glu Tyr Glu Thr Pro Ser Pro Pro Glu lie Glu Tyr 
115 120 125 

Glu Asn Glu Phe Glu Val Pro Phe Gin Leu Leu Lys Asp' Ser lie Ala 
.130 135 140 

Asp lie Asp lie Phe Ser Asp Lys lie Thr Phe Arg Val Asp Glu Asp 
145 150 155 160 

Arg Phe lie Ala Ser Ala Glu Gly Glu Phe Gly Asp Ala Gin lie Glu 
165 170 175 

Tyr Leu His Gly Glu Arg lie Asp Lys Pro Ala Arg Ser lie Tyr Ser 
180 185 190 

Leu Asp Lys lie Lys Glu Met Leu Lys Ala Asp Lys Phe Ser Glu Thr 
1 95 200 205 

Ala lie He Asn Leu Gly Asp Asp Met Pro Leu Lys Leu Thr Leu Lys 
210 215 220 

Met Ala Ser Lys Glu Gly Glu Leu Ser Phe Leu Leu Ala Pro Arg lie 
225 230 235 240 

Glu Ala Glu Glu 



(2) ANGABEN ZU SEQ ID NO: 16: 

(i) SEQUENZKENNZEICHEN: 

(A) LANGE: 469 Aminosauren 

(B) ART: Aminosaure 

(C) STRANGFORM: Einzelstrang 
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(D) TOPOLOGIE: linear 

lii) ART DES MOLEKULS: Protein 

(vi) URSPRUNLICHE HERKUNFT: 

(A) ORGANISMUS: Homo sapiens 

(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 16: 

Met Phe Set Glu Gin Ala Ala Gin Arg Ala His Thr Leu Leu Ser Pro 
15 10 15 

20 Pro Ser Ala Asn Asn Ala Thr Phe Ala Arg Val Pro Val Ala Thr Tyr 
20 25 30 



10 



IS 



25 



30 



35 



Thr Asn Ser Ser Gin Pro Phe Arg Leu Gly Glu Arg Ser Phe Ser Arg 
35 40 45 

Gin Tyr Ala His lie Tyr Ala Thr Arg Leu lie Gin Met Arg Pro Phe 
50 55 60 

Leu Glu Asn Arg Ala Gin Gin His Trp Gly Ser Gly Val Gly Val Lys 
65 70 75 80 

Lys Leu Cys Glu Leu Gin Pro Glu Glu Lys Cys Cys Val Val Gly Thr 
85 90 95 

40 Leu Phe Lys Ala Met Pro Leu Gin Pro Ser lie Leu Arg Glu Val Ser 
100 105 110 

Glu Glu His Asn Leu Leu Pro Gin Pro Pro Arg Ser Lys Tyr lie His 
« 115 120 125 

Pro Asp Asp Glu Leu Val Leu Glu Asp Glu Leu Gin Arg lie Lys Leu 
130 135 140 
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55 



60 



65 



Lys Gly Thr lie Asp Val Ser Lys Leu Val Thr Gly Thr Val Leu Ala 
145 150 155 160 

Val Phe Gly Ser Val Arg Asp Asp Gly Lys Phe Leu Val Glu Asp Tyr 
165 170 175 

Cys Phe Ala Asp Leu Ala Pro Gin Lys Pro Ala Pro Pro Leu Asp Thr 
180 185 190 
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Asp Arg Phe Val Leu Leu Val Ser Gly Leu Gly Leu Gly Gly Gly Gly 
195 200 205 

Gly Glu Ser Leu Leu Gly Thr Gin Leu Leu Val Asp Val Val Thr Gly 
210 215 220 

Gin Leu Gly Asp Glu Gly Glu Gin Cys Ser Ala Ala His Val Ser Arg 
225 230 235 240 

Val lie Leu Ala Gly Asn Leu Leu Ser His Ser Thr Gin Ser Arg Asp 
245 250 255 

Ser lie Asn Lys Ala Lys Tyr Leu Thr Lys Lys Thr Gin Ala Ala Ser 
260 265 270 



Val Glu Ala Val Lys Met Leu Asp Glu lie Leu Leu Gin Leu Ser Ala 
275 280 285 

Ser Val Pro Val Asp Val Met Pro Gly Glu Phe Asp Pro Thr- Asn Tyr 
290 295 300 

Thr Leu Pro Gin Gin Pro Leu His Pro Cys Met Phe Pro Leu Ala Thr 
305 310 315 320 

Ala Tyr Ser Thr Leu Gin Leu Val Thr Asn Pro Tyr Gin Ala Thr lie 
325 330 335 

Asp Gly Val Arg Phe Leu Gly Thr Ser Gly Gin Asn Val Ser Asp lie 
340 345 350 

Phe Arg Tyr Ser Ser Met Glu Asp His Leu Glu lie Leu Glu Trp Thr 
355 360 365 

Leu Arg Val Arg His lie Ser Pro Thr Ala Pro Asp Thr Leu Gly Cys 
370 375 380 

Tyr Pro Phe Tyr Lys Thr Asp Pro Phe He Phe Pro Glu Cys Pro His 
385 390 395 400 

Val Tyr Phe Cys Gly Asn Thr Pro Ser Phe Gly Ser Lys lie lie Arg 
405 410 415 

Gly Pro Glu Asp Gin Thr Val Leu Leu Val Thr Val Pro Asp Phe Ser 
420 425 430 

Ala Thr Gin Thr Ala Cys Leu Val Asn Leu Arg Ser Leu Ala Cys Gin 
435 440 445 
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Pro lie Ser Phe Ser Gly Phe Gly Ala Glu Asp Asp Asp Leu Gly Gly 
450 455 450 

Leu Gly Leu Gly Pro 
465 

(2) ANGABEN ZU SEQ ID NO: 17: 

(i) SEQUENZKENNZEICHEN: 

(A) LANGE: 488 Amlnosauren 

(B) ART: Aminosaure 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: linear 

(ii) ART DES MOLEKOLS: Protein 

(vi) URSPRUNLICHE HERKUNFT: 

(A) ORGANISMUS: Archaeoglobus fulgidus 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 17: 

Met Val He Lys Asn He Asp Ala Ala Thr Val Ala Lys Lys Phe Leu 
1 5 10 15 

Val Arg Gly Tyr Asn He Asp Pro Lys Ala Ala Glu Leu He Cys Lys 
20 25 30 

Ser Gly Leu Phe Ser Asp Glu Leu Val Asp Lys lie Cys Arg lie Ala 
35 40 45 

Asn Gly Gly Phe lie lie Glu Lys Ser Val Val Glu Glu Phe Leu Arg 
50 55 60 

Asn Leu Ser Asn Leu Lys Pro Ala Thr Leu Thr Pro Arg Pro Glu Glu 
65 70 75 80 

Arg Lys Val Glu Glu Val Lys Ala Ser Cys lie Ala Leu Lys Val lie 
85 90 95 

Lys Asp lie Thr Gly Lys Ser Ser Cys Gin Gly Asn Val Glu. Asp Phe 
100 105 110 

Leu Met Tyr Phe Asn Ser Arg Leu Glu Lys Leu Ser Arg lie He Arg 
115 120 125 
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Set Arg Val Asn Thr Thr Pro lie Ala His Ala Gly Lys Val Arg Gly 
130 135 140 

Asn Val Ser Val Val Gly Met Val Asn Glu Val Tyr Glu Arg Gly Asp 
145 150 155 160 

Lys Cys Tyr lie Arg Leu Glu Asp Thr Thr Gly Thr lie Thr Cys Val 
165 170 175 

Ala Thr Gly Lys Asn Ala Glu Val Ala Arg Glu Leu Leu Gly Asp Glu 
180 185 190 

Val lie Gly Val Thr Gly Leu Leu Lys Gly Ser Ser Leu Tyr Ala Asn 
1 95 200 205 

Arg lie Val Phe Pro Asp Val Pro lie Asn Gly Asn Gly Glu Lys Lys 
210 215 220 

Arg Asp Phe Tyr lie Val Phe Leu Ser Asp Thr His Phe Gly Ser Lys 
225 230 235 240 

Glu Phe Leu Glu Lys Glu Trp Glu Met Phe Val Arg Trp Leu Lys Gly 
245 250 255 

Glu Val Gly Gly Lys Lys Ser Gin Asn Leu Ala Glu Lys Val Lys Tyr 
260 265 270 

lie Val lie Ala Gly Asp lie Val Asp Gly lie Gly Val Tyr Pro Gly 
275 280 285 

Gin Glu Asp Asp Leu Ala lie Ser Asp lie Tyr Gly Gin Tyr Glu Phe 
290 295 300 

Ala Ala Ser His Leu Asp Glu lie Pro Lys Glu lie Lys lie lie Val 
305 310 315 320 

Ser Pro Gly Asn His Asp Ala Val Arg Gin Ala Glu Pro Gin Pro Ala 
325 330 335 

Phe Glu Gly Glu He Arg Ser Leu Phe Pro Lys Asn Val Glu His Val 
340 345 350 

Gly Asn Pro Ala Tyr Val Asp lie Glu Gly Val Lys Val Leu lie Tyr 
355 360 365 

His Gly Arg Ser lie Asp Asp lie lie Ser Lys lie Pro Arg Leu Ser 
370 375 380 
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Tyr Asp Glu Pro Gin Lys Val Met Glu Glu Leu Leu Lys Arg Arg His 
385 390 395 400 

Leu Ser Pro lie Tyr Gly Gly Arg Thr Pro Leu Ala Pro Glu Arg Glu 
405 410 415 

Asp Tyr Leu Val He Glu Asp Val Pro Asp lie Leu His Cys Gly His 
420 425 430 

lie His Thr Tyr Gly Thr Gly Phe Tyr Arg Gly Val Phe Met Val Asn 
435 440 445 

Ser Ser Thr Trp Gin Ala Gin Thr Glu Phe Gin Lys Lys Val Asn Leu 
450 455 460 

Asn Pro Met Pro Gly Asn Val Ala Val Tyr Arg Pro Gly Gly Glu Val 
465 470 475 480 

lie Arg Leu Arg Phe Tyr Gly Glu 

485 K 

(2) ANGABEN ZU SEQ ID NO: 18: 

(i) SEQUENZKENNZEICHEN: 

(A) LANGE: 594 Aminosauren 

(B) ART: Aminosaure 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: linear 

(li) ART DES MOLEKOLS: Protein 

Ivi) URSPRUNLICHE HERKUNFT: 

lA) ORGANISMUS: Methanococcus jannaschii 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 18: 

Met Glu lie lie Asn Lys Phe Leu Asp Leu Glu Ala Leu Leu Ser Pro 
15 10 15 

Thr Val Tyr Glu Lys Leu Lys Asn Phe Asp Glu Glu Lys Leu Lys Arg 
20 25 30 

Leu lie Gin Lys lie Arg Glu Phe Lys Lys Tyr Asn Asn Ala Phe lie 
35 40 45 
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Leu Leu Asp Glu Lys Phe Leu Asp lie Phe Leu Gin Lys Asp Leu Asp 
50 55 60 



Glu lie lie Asn Glu Tyr Lys Asp Phe Asp Phe lie Phe Tyr Tyr Thr 
65 70 75 80 

Gly Glu Glu Glu Lys Glu Lys Pro Lys Glu Val Lys Lys Glu He Lys 
85 90 95 

Lys Glu Thr Glu Glu Lys lie Glu Lys Glu Lys lie Glu Phe Val Lys 
100 105 110 

Lys Glu Glu Lys Glu Gin Phe lie Lys Lys Ser Asp Glu Asp Val Glu 
115 120 125 

Glu Lys Leu Lys Gin Leu He Ser Lys Glu Glu Lys Lys Glu Asp Phe 
130 135 140 

Asp Ala Glu Arg Ala Lys Arg Tyr Glu His lie Thr Lys lie Ly^ Glu 
145 150 155 160 

Ser Val Asn Ser Arg He Lys Trp lie Ala Lys Asp lie Asp Ala Val 
165 170 175 

lie Glu lie Tyr Glu Asp Ser Asp Val Ser Gly Lys Ser Thr Cys Thr 
180 185 190 

Gly Thr He Glu Asp Phe Val Lys Tyr Phe Arg Asp Arg Phe Glu Arg 
195 200 205 

Leu Lys Val Phe lie Glu Arg Lys Ala Gin Arg Lys Gly Tyr Pro Leu 
210 215 220 

Lys Asp lie Lys Lys Met Lys Gly Gin Lys Asp lie Phe Val Val Gly 
225 230 235 240 

lie Val Ser Asp Val Asp Ser Thr Arg Asn Gly Asn Leu He Val Arg 
245 250 255 

He Glu Asp Thr Glu Asp Glu Ala Thr Leu He Leu Pro Lys Glu Lys 
260 265 270 

He Glu Ala Gly Lys He Pro Asp Asp He Leu Leu Asp Glu Val lie 
275 280 285 

Gly Ala He Gly Thr Val Ser Lys Ser Gly Ser Ser He Tyr Val Asp 
290 295 300 



57 



DE 198 40 771 Al 

Glu lie lie Arg Pro Ala Leu Pro Pro Lys Glu Pro Lys Arg lie Asp 
305 310 315 320 

Glu Glu lie Tyr Met Ala Phe Leu Ser Asp lie His Val Gly Ser Lys 
325 330 335 



10 



Glu Phe Leu His Lys Glu Phe Glu Lys Phe lie Arg Phe Leu Asn Gly 
340 345 350 



15 



Asp Val Asp Asn Glu Leu Glu Glu Lys Val Val Ser Arg L^u Lys Tyr 
355 360 365 



20 



lie Cys lie Ala Gly Asp Leu Val Asp Gly Val Gly Val Tyr Pro Gly 
370 375 380 

Gin Glu Glu Asp Leu Tyr Glu Val Asp lie lie Glu GIn Tyr Arg Glu 
385 390 395 400 



25 



lie Ala Met Tyr Leu Asp Gin He Pro Glu His lie Ser He lie llq 
405 410 415 



30 



Ser Pro Gly Asn His Asp Ala Val Arg Pro Ala Glu Pro Gin Pro Lys 
420 425 430 



35 



Leu Pro Glu Lys lie Thr Lys Leu Phe Asn Arg Asp Asn lie Tyr Phe 
435 440 445 



Val Gly Asn Pro Cys Thr Leu Asn lie His Gly Phe Asp Thr Leu Leu 
450 455 460 

40 

Tyr His Gly Arg Ser Phe Asp Asp Leu Val Gly Gin lie Arg Ala Ala 
465 470 475 480 

45 Ser Tyr Glu Asn Pro Val Thr He Met Lys Glu Leu lie Lys Arg Arg 
485 490 495 



so 



Leu Leu Cys Pro Thr Tyr Gly Gly Arg Cys Pro lie Ala Pro Glu His 
500 505 510 
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Lys Asp Tyr Leu Val He Asp Arg Asp lie Asp lie Leu His Thr Gly 
515 520 525 

His lie His lie Asn Gly Tyr Gly lie Tyr Arg Gly Val Val Met Val 
530 535 540 

Asn Ser Gly Thr Phe Gin Glu Gin Thr Asp Phe Gin Lys Arg Met Gly 
545 550 555 560 



65 
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He Ser Pro Thr Pro Ala lie Val Pro lie lie Asn Met Ala Lys Val 
565 570 575 

Gly Glu Lys Gly His Tyr Leu Glu Trp Asp Arg Gly Val Leu Glu Val 
580 585 590 

Arg Tyr 



(2) ANGABEN ZU SEQ ID NO: 19: 

(i) SEQUENZKENNZEICHEN: 

(A) LANGE: 622 Aminosauren 

(B) ART: Aminosaure 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: linear 

(ii) ART DES MOLEKULS: Protein 

(vi) URSPRUNUCHE HERKUNIFT: 

(A) ORGANISMUS: Pyrococcus horikoshii 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 19: 

Met Asp Glu Phe Val Lys Gly Leu Met Lys Asn Gly Tyr Leu lie Thr 
15 10 15 

Pro Ser Ala Tyr Tyr Leu Leu Val Gly His Phe Asn Glu Gly Lys Phe 
20 25 30 

Ser Leu lie Glu Leu lie Lys Phe Ala Lys Ser Arg Glu Thr Phe lie 
35 40 45 

He Asp Asp Glu lie Ala Asn Glu Phe Leu Lys Ser lie Gly Ala Glu 
50 55 60 

Val Glu Leu Pro Gin Glu lie Lys Glu Gly Tyr lie Ser Thr Gly Glu 
65 70 75 80 

Gly Ser Gin Lys Val Pro Asp His Glu Glu Leu Glu Lys lie Thr Asn ^ 
85 90 95 

Glu Ser Ser Val Glu Ser Ser lie Ser Thr Gly Glu Thr Pro Lys Thr 
100 105 110 
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Glu Glu Leu Gin Pro Thr Leu Asp lie Leu Glu Glu Glu lie Gly Asp 
115 120 125 

^ lie Glu Gly Gly Glu Ser Ser He Ser Thr Gly Asp Glu Val Pro Glu 
130 135 140 

Val Glu Asn Asn Asn Gly Gly Thr Val Val Val Phe Asp Lys Tyr Gly 
145 150 155 160 

Tyr Pro Phe Thr Tyr Val Pro Glu Glu lie Glu Glu Glu Leu Glu Glu 
15 165 170 175 

Tyr Pro Lys Tyr Glu Asp Val Thr lie Glu lie Asn Pro Asn Leu Glu 
1 80 1 85 1 90 

20 

Val Val Pro lie Glu Lys Asp Tyr Glu lie Lys Phe Asp Val Arg Arg 
1 95 200 205 

Val Lys Leu Lys Pro Pro Lys Val Lys Ser Gly Ser Gly Lys GJu Gly 
210 215 220 

^ Glu lie lie Val Glu Ala Tyr Ala Ser Leu Phe Arg Ser Arg Leu Arg 
225 230 235 240 

Lys Leu Arg Arg lie Leu Arg Glu Asn Pro Glu Val Ser Asn Val lie 
35 245 250 255 

Asp lie Lys Lys Leu Lys Tyr Val Lys Gly Asp Glu Glu Val Thr lie 
260 265 270 

40 

lie Gly Leu Val Asn Ser Lys Lys Glu Thr Ser Lys Gly Leu lie Phe 
275 280 285 

45 Glu Val Glu Asp Gin Thr Asp Arg Val Lys Val Phe Leu Pro Lys Asp 
290 295 300 

Ser Glu Asp Tyr Arg Glu Ala Leu Lys Val Leu Pro Asp Ala Val Val 
305 310 315 320 

Ala Phe Lys Gly Val Tyr Ser Lys Arg Gly lie Phe Phe Ala Asn Arg 
55 325 330 335 

Phe Tyr Leu Pro Asp Val Pro Leu Tyr Arg Lys Gin Lys Pro Pro Leu 
340 345 350 

60 

Glu Glu Lys Val Tyr Ala Val Leu Thr Ser Asp He His Val Gly Ser 
355 360 365 
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Lys Glu Phe Cys Glu Lys Ala Phe lie Lys Phe Leu Glu Trp Leu Asn 
370 375 380 

Gly Tyr Val Glu Ser Lys Glu Glu Glu Glu lie Val Ser Arg lie Arg ^ 
385 390 395 400 

Tyr Leu He Me Ala Gly Asp Val Val Asp Gly lie Gly lie Tyr Pro 
405 410 415 

Gly Gin Tyr Ser Asp Leu He lie Pro Asp lie Phe Asp Gin Tyr Glu 

420 425 430 ^3 

Ala Leu Ala Asn Leu Leu Ser Asn Val Pro Lys His lie Thr lie Phe 
435 440 445 

20 

lie Gly Pro Gly Asn His Asp Ala Ala Arg Pro Ala lie Pro Gin Pro 
450 455 460 

Glu Phe Tyr Glu Glu Tyr Ala Lys Pro Leu Tyr Lys Leu Lys ^sn Thr 
465 470 475 480 

Val lie lie Ser Asn Pro Ala Val lie Arg Leu His Gly Arg Asp Phe 

485 490 495 ^ 

Leu lie Ala His Gly Arg Gly lie Glu Asp Val Val Ser Phe Val Pro 
500 505 510 

35 

Gly Leu Thr His His Lys Pro Gly Leu Pro Met Val Glu Leu Leu Lys 
515 520 525 

40 

Met Arg His Leu Ala Pro Thr Phe Gly Gly Lys Val Pro lie Ala Pro 
530 535 540 

Asp Pro Glu Asp Leu Leu Val lie Glu Glu Val Pro Asp Leu Val Gin 45 
545 550 555 560 

Met Gly His Val His Val Tyr Asp Thr Ala Val Tyr Arg Gly Val Gin 

565 570 575 so 

Leu Val Asn Ser Ala Thr Trp Gin Ala Gin Thr Glu Phe Gin Lys Met 
580 585 590 

55 

Val Asn lie Val Pro Thr Pro Gly Leu Val Pro lie Val Asp Val Glu 
595 600 605 

Ser Ala Arg Val lie Lys Val Leu Asp Phe Ser Arg Trp Cys 
610 615 620 



65 
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(2) ANGABEN ZU SEQ ID NO: 20: 

(i) SEQUENZKENNZEICHEN: 

(A) LANGE: 482 Aminosauren 
IB) ART: Aminosaure 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: linear 

(ii) ART DES MOLEKULS: Protein 

(vi) URSPRONLICHE HERKUNFT: 

(A) ORGANISMUS: Methanobacterium thermoautotrophicum 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 20: 



Met Asn Glu lie lie Gly Lys Phe Ala Arg Glu Gly lie LeuHle Glu 

1 5 10 15 ^; 

Asp Asn Ala Tyr Phe Arg Leu Arg Glu Met Asp Asp Pro Ala Ser Val 
20 25 30 

Ser Ser Glu Leu lie Val Lys lie Lys Ser Asn Gly Gly Lys Phe Thr 
35 40 45 

Val Leu Thr Ser Glu Met Leu Asp Glu Phe Phe Glu lie Asp Asn Pro 
50 55 60 

Ala Glu lie Lys Ala Arg Gly Pro Leu Met Val Pro Ala Glu Arg Asp 
65 70 75 80 

Phe Asp Phe Glu Val lie Ser Asp Thr Ser Asn Arg Ser Tyr Thr Ser 
85 90 95 

Gly Glu lie Gly Asp Met lie Ala Tyr Phe Asn Ser Arg Tyr Ser Ser 
100 105 110 

Leu Lys Asn Leu Leu Ser Lys Arg Pro Glu Leu Lys Gly His lie Pro 
115 120 125 

He Ala Asp Leu Arg Gly Gly Glu Asp Val Val Ser lie lie Gly Met 
130 135 140 

Val Asn Asp Val Arg Asn Thr Lys Asn Asn His Arg lie lie Glu Leu 
145 150 155 160 
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Glu Asp Asp Thr Gly Glu lie Ser Val Val Val His Asn Glu Asn His 
165 170 175 

Lys Leu Phe Glu Lys Ser Glu Lys lie Val Arg Asp Glu Val Val Gly 
1 80 1 85 1 90 

Val His Gly Thr Lys Lys Gly Arg Phe Val Val Ala Ser Glu lie Phe 
1 95 200 205 

His Pro Gly Val Pro Arg lie Gin Glu Lys Glu Met Asp Phe Ser Val 
210 215 220 

Ala Phe lie Ser Asp Val His He Gly Ser Gin Thr Phe Leu Glu Asp 
225 230 235 240 

Ala Phe Met Lys Phe Val Lys Trp lie Asn Gly Asp Phe Gly Ser Glu 
245 250 255 

Glu Gin Arg Ser Leu Ala Ala Asp Val Lys Tyr Leu Val Val Ala Gly 
260 265 270 

Asp lie Val Asp Gly lie Gly lie Tyr Pro Gly Gin Glu Lys Glu Leu 
275 280 285 

Leu lie Arg Asp lie His Glu Gin Tyr Glu Glu Ala Ala Arg Leu Phe 
290 295 300 

Gly Asp lie. Arg Ser Asp lie Lys lie Val Met lie Pro Gly Asn His 
305 310 315 320 

Asp Ser Ser Arg lie Ala Glu Pro Gin Pro Ala lie Pro Glu Glu Tyr 
325 330 335 

Ala Lys Ser Leu Tyr Ser lie Arg Asn lie Glu Phe Leu Ser Asn Pro 
340 345 350 

Ser Leu Val Ser Leu Asp Gly Val Arg Thr Leu lie Tyr His Gly Arg 
355 360 365 

Ser Phe Asp Asp Met Ala Met Ser Val Asn Gly Leu Ser His Glu Arg 
370 375 380 

Ser Asp Leu lie Met Glu Glu Leu Leu Glu Lys Arg His Leu Ala Pro 
385 390 395 400 

lie Tyr Gly Glu Arg Thr Pro Leu Ala Ser Glu lie Glu Asp His Leu 
405 410 415 
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Val lie Asp Glu Val Pro His Val Leu His Thr Gly His Val His lie 
420 425 430 

Asn Ala Tyr Lys Lys Tyr Lys Gly Val His Leu lie Asn Ser Gly Thr 
435 440 445 

Phe Gin Ser Giri Thr Glu Phe Gin Lys lie Tyr Asn He Val Pro Thr 
450 455 460 

Cys Gly Gin Val Pro Val Leu Asn Arg Gly Val Met Lys Leu Leu Glu 
465 470 475 480 

Phe Ser 



(2) ANGABEIM ZU SEQ ID NO: 21: 

(i) SEQUENZKENNZEICHEN: 

(A) LANGE: 613 An^inosauren 

(B) ART: Aminosaure 

(C) STRANGFORM: Einzelstrang 
{D) TOPOLOGIE: linear 

(ii) ART DBS MOLEKULS: Protein 

(vi) URSPRONLICHE HERKUNFT: 

(A) ORGANISMUS: Pyrococcus furiosus 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 21: 

Met Asp Glu Phe Val Lys Ser Leu Leu Lys Ala Asn Tyr Leu lie Thr 
1 5 10 15 

Pro Ser Ala Tyr Tyr Leu Leu Arg Glu Tyr Tyr Glu Lys Gly Glu Phe 
20 25 30 

Ser He Val Glu Leu Val Lys Phe Ala Arg Ser Arg Glu Ser Tyr lie 
35 40 45 

lie Thr Asp Ala Leu Ala Thr Glu Phe Leu Lys Val Lys Gly Leu Glu 
50 55 60 

Pro He Leu Pro Val Glu Thr Lys Gly Gly Phe Val Ser Thr Gly Glu 
65 70 75 80 
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Ser Gin Lys Glu Gin Ser Tyr Glu Glu Ser Phe Gly Thr Lys Glu Glu 
85 90 95 

lie Ser Gin Glu lie Lys Glu Gly Glu Ser Phe lie Ser Thr Gly Ser ^ 

100 105 no 

Glu Pro Leu Glu Glu Glu Leu Asn Ser lie Gly lie Glu Glu He Gly 
115 120 125 

Ala Asn Glu Glu Leu Val Ser Asn Gly Asn Asp Asn Gly Gly Glu Ala 

130 135 140 15 

lie Val Phe Asp Lys Tyr Gly Tyr Pro Met Val Tyr Ala Pro Glu Glu 
145 150 155 160 

20 

lie Glu Val Glu Glu Lys Glu Tyr Ser Lys Tyr Glu Asp Leu Thr lie 
165 170 175 

Pro Met Asn Pro Asp Phe Asn Tyr Val Glu lie Lys Glu Asp^Tyr Asp 25 
180 185 190 

Val Val Phe Asp Val Arg Asn Vai Lys Leu Lys Pro Pro Lys Val Lys 
195 200 205 

Asn Gly Asn Gly Lys Glu Gly Glu lie lie Val Glu Ala Tyr Ala Ser 
210 215 220 

35 

Leu Phe Arg Ser Arg Leu Lys Lys Leu Arg Lys lie Leu Arg Glu Asn 
225 230 235 240 

40 

Pro Glu Leu Asp Asn Val Val Asp lie Gly Lys Leu Lys Tyr Val Lys 
245 250 255 

Glu Asp Glu Thr Val Thr lie lie Gly Leu Val Asn Ser Lys Arg Glu 4s 
260 265 270 

Val Asn Lys Gly Leu lie Phe Glu lie Glu Asp Leu Thr Gly Lys Val 

275 280 285 so 

Lys Val Phe Leu Pro Lys Asp Ser Glu Asp Tyr Arg Glu Ala Phe Lys 
290 295 300 

Val Leu Pro Asp Ala Val Val Ala Phe Lys Gly Val Tyr Ser Lys Arg 
305 310 315 320 

Gly lie Leu Tyr Ala Asn Lys Phe Tyr Leu Pro Asp Val Pro Leu Tyr 
325 330 335 
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Arg Arg Gin Lys Pro Pro Leu Glu Glu Lys Val Tyr Ala lie Leu lie 
340 345 350 

^ Ser Asp lie His Val Gly Ser Lys Glu Phe Cys Glu Asn Ala Phe lie 
355 360 365 

,0 Lys Phe Leu Glu Trp Leu Asn Gly Asn Val Glu Thr Lys Glu Glu Glu 
370 375 380 

Glu lie Val Ser Arg Val Lys Tyr Leu lie lie Ala Gly Asp Val Val 
IS 385 390 395 400 

Asp Gly Val Gly Val Tyr Pro Gly Gin Tyr Ala Asp Leu Thr lie Pro 
405 410 415 

20 

Asp lie Phe Asp Gin Tyr Glu Ala Leu Ala Asn Leu Leu Ser His Val 
420 425 430 

Pro Lys His lie Thr Met Phe lie Ala Pro Gly Asn His Asp Alp Ala 
435 440 445 

^ Arg Gin Ala lie Pro Gin Pro Glu Phe Tyr Lys Glu Tyr Ala Lys Pro 
450 455 460 

He Tyr Lys Leu Lys Asn Ala Val lie lie Ser Asn Pro Ala Val lie 
35 465 470 475 480 

Arg Leu His Gly Arg Asp Phe Leu lie Ala His Gly Arg Gly lie Glu 
485 490 495 

40 

Asp Val Val Gly Ser Val Pro Gly Leu Thr His His Lys Pro Gly Leu 
500 505 510 

45 Pro Met Val Glu Leu Leu Lys Met Arg His Val Ala Pro Met Phe Gly 
515 520 525 

Gly Lys Val Pro lie Ala Pro Asp Pro Glu Asp Leu Leu Val lie Glu 
* 530 535 540 

Glu Val Pro Asp Val Val His Met Gly His Val His Val Tyr Asp Ala 
55 545 550 555 560 

Val Val Tyr Arg Gly Val Gin Leu Val Asn Ser Ala Thr Trp Gin Ala 
565 570 575 

60 

Gin Thr Glu Phe Gin Lys Met Val Asn lie Val Pro Thr Pro Ala Lys 
580 585 590 
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Val Pro Val Val Asp lie Asp Thr Ala Lys Val Val Lys Val Leu Asp 
595 600 605 

Phe Ser Gly Trp Cys 
610 

(2) ANGABEN ZU SEQ ID NO: 22: 

(I) SEQUENZKENNZEICHEN: 

(A) LANGE: 1 1 07 Aminosauren 

(B) ART: Aminosaure 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: linear 

(ii) ART DES MOLEKULS: Protein 

(vi) URSPRONLICHE HERKUNFT: 

(A) ORGANISMUS: Homo sapiens 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 22: 

Met Asp Gly Lys Arg Arg Pro Gly Pro Gly Pro Gly Val Pro Pro Lys 
15 10 15 

Arg Ala Arg Gly Gly Leu Trp Asp Asp Asp Asp Ala Pro Trp Pro Ser 
20 25 30 

Gin Phe Glu Glu Asp Leu Ala Leu Met Glu Glu Met Glu Ala Giu His 
35 40 45 

Arg Leu Gin Glu Gin Glu Glu Glu Giu Leu Gin Ser Val Leu Glu Gly 
50 55 60 

Val Ala Asp Gly Gin Val Pro Pro Ser Ala lie Asp Pro Arg Trp Leu 
65 70 75 80 

Arg Pro Thr Pro Pro Ala Leu Asp Pro Gin Thr Glu Pro Leu lie Phe 
85 90 95 

Gin Gin Leu Glu lie Asp His Tyr Val Gly Pro Ala Gin Pro Val Pro 
100 105 110 

Gly Gly Pro Pro Pro Ser Arg Gly Ser Val Pro Val Leu Arg Ala Phe 
115 120 125 
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Gly Val Thr Asp Glu Gly Phe Ser Val Cys Cys His lie His Gly Phe 
130 135 140 

Ala Pro Tyr Phe Tyr Thr Pro Ala Pro Pro Gly Phe Gly Pro Glu His 
145 150 155 160 

,0 Met Gly Asp Leu Gin Arg Glu Leu Asn Leu Ala lie Ser Arg Asp Ser 
165 170 175 

Arg Gly Gly Arg Glu Leu Thr Gly Pro Ala Val Leu Ala Val Glu Leu 
15 180 185 190 

Cys Ser Arg Glu Ser Met Phe Gly Tyr His Gly His Gly Pro Ser Pro 
1 95 200 205 

20 

Phe Leu Arg lie Thr Val Ala Leu Pro Arg Leu Val Ala Pro Ala Arg 
210 215 220 

^ Arg Leu Leu Glu Gin Gly lie Arg Val Ala Gly Leu Gly Thr Ptp Ser 
225 230 235 240 

^ Phe Ala Pro Tyr Glu Ala Asn Val Asp Phe Glu lie Arg Phe Met Val 
245 250 255 

Asp Thr Asp lie Val Gly Cys Asn Trp Leu Glu Leu Pro Ala Gly Lys 
35 260 265 270 

Tyr Ala Leu Arg Leu Lys Glu Lys Ala Thr Gin Cys Gin Leu Glu Ala 
275 280 285 ' 

40 

Asp Val Leu Trp Ser Asp Val Val Ser His Pro Pro Glu Gly Pro Trp 
290 295 300 

45 Gin Arg lie Ala Pro Leu Arg Val Leu Ser Phe Asp lie Glu Cys Ala 
305 310 315 320 

Gly Arg Lys Gly lie Phe Pro Glu Pro Glu Arg Asp Pro Val lie Gin 

^ 325 330 335 

lie Cys Ser Leu Gly Leu Arg Trp Gly Glu Pro Glu Pro Phe Leu Arg 
5j 340 345 350 

Leu Ala Leu Thr Leu Arg Pro Cys Ala Pro lie Leu Gly Ala Lys Val 
355 360 365 

60 

Gin Ser Tyr Glu Lys Glu Glu Asp Leu Leu Gin Ala Trp Ser Thr Phe 
370 375 380 
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lie Arg lie Met Asp Pro Asp. Val lie Thr Gly Tyr Asn lie Gin Asn 
385 390 395 400 

Phe Asp Leu Pro Tyr Leu lie Ser Arg Ala Gin Thr Leu Lys Val Gin * 
405 410 415 

Thr Phe Pro Phe Leu Gly Arg Val Ala Gly Leu Cys Ser Asn lie Arg 
420 425 430 

Asp Ser Ser Phe Gin Ser Lys Gin Thr Gly Arg Arg Asp Thr Lys Val 

435 440 445 is 

Val Ser Met Val Gly Arg Val Gin Met Asp Met Leu Gin Val Leu Leu 
450 455 460 

20 

Arg Glu Tyr Lys Leu Arg Ser His Thr Leu Asn Ala Val Ser Phe His 
465 470 475 480 

■V. 

Phe Leu Gly Glu Gin Lys Glu Asp Val Gin His Ser lie lie Thr Asp 
485 490 495 

Leu Gin Asn Gly Asn Asp Gin Thr Arg Arg Arg Leu Ala Val Tyr Cys 

500 505 510 ^ 

Leu Lys Asp Ala Tyr Leu Pro Leu Arg Leu Leu Glu Arg Leu Met Val 
515 520 525 

Leu Val Asn Ala Val Glu Met Ala Arg Val Thr Gly Val Pro Leu Ser 
530 535 540 

40 

Tyr Leu Leu Ser Arg Gly Gin Gin Val Lys Val Val Ser Gin Leu Leu 
545 550 555 560 

Arg Gin Ala Met His Glu Gly Leu Leu Met Pro Val Val Lys Ser Glu 45 
565 570 575 

Gly Gly Glu Asp Tyr Thr Gly Ala Thr Val lie Glu Pro Leu Lys Gly 

580 585 590 50 

Tyr Tyr Asp VaJ Pro lie Ala Thr Leu Asp Phe Ser Ser Leu Tyr Pro 
595 600 605 

Ser lie Met Met Ala His Asn Leu Cys Tyr Thr Thr Leu Leu Arg Pro 
610 615 620 

Gly Thr Ala Gin Lys Leu Gly Leu Thr Glu Asp Gin Phe lie Arg Thr 
625 630 635 640 
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Pro Thr Gly Asp Glu Phe Val Lys Thr Ser Val Arg Lys Gly Leu Leu 
645 650 655 

Pro Gin lie Leu Glu Asn Leu Leu Ser Ala Arg Lys Arg Ala Lys Ala 
660 , 665 670 



10 



Glu Leu Ala Lys Glu Thr Asp Pro Leu Arg Arg Gin Val Leu Asp Gly 
675 680 685 



Arg Gin Leu Ala Leu Lys Val Ser Ala Asn Ser Val Tyr Gly Phe Thr 
IS 690 695 700 

Gly Ala Gin Val Gly Lys Leu Pro Cys Leu Glu lie Ser Gin Ser Val 
705 710 715 720 



20 



Thr Gly Phe Gly Arg Gin Met lie Glu Lys Thr Lys Gin Leu Val Glu 
725 730 735 



25 



Ser Lys Tyr Thr Val Glu Asn Gly Tyr Ser Thr Ser Ala Lys Val Val 
740 745 750 



30 Tyr Gly Asp Thr Asp Ser Val Met Cys Arg Phe Gly Val Ser Ser Val 
755 760 765 



35 



Ala Glu Ala Met Ala Leu Gly Arg Glu Ala Ala Asp Trp Val Ser Gly 
770 775 780 



His Phe Pro Ser Pro lie Arg Leu Glu Phe Glu Lys Val Tyr Phe Pro 
785 790 795 800 

40 

Tyr Leu Leu lie Ser Lys Lys Arg Tyr Ala Gly Leu Leu Phe Ser Ser 
805 810 815 

45 Arg Pro Asp Ala His Asp Arg Met Asp Cys Lys Gly Leu Glu Ala Val 
820 825 830 

Arg Arg Asp Asn Cys Pro Leu Val Ala Asn Leu Val Thr Ala Ser Leu 
835 840 845 



55 



AO 



Arg Arg Leu Leu lie Asp Arg Asp Pro Glu Gly Ala Val Ala His Ala 
850 855 860 

Gin Asp Val lie Ser Asp Leu Leu Cys Asn Arg lie Asp He Ser Gin 
865 870 875 880 

Leu Val lie Thr Lys Glu Leu Thr Arg Ala Ala Ser Asp Tyr Ala Gly 
885 890 895 
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Lys Gin Ala His Val GIu Leu Ala Glu Arg Met Arg Lys Arg Asp Pro 
900 ' 905 910 

Gly Ser Ala Pro Ser Leu Gly Asp Arg Val Pro Tyr Val lie lie Ser 
915 920 925 

Ala Ala Lys Gly Val Ala Ala Tyr Met Lys Ser Glu Asp Pro Leu Phe 
930 935 940 

Val Leu Glu His Ser Leu Pro lie Asp Thr Gin Tyr Tyr Leu Glu Gin 
945 950 955 960 

Gin Leu Ala Lys Pro Leu Leu Arg lie Phe Glu Pro lie Leu Gly Glu 
965 970 975 

Gly Arg Ala Glu Ala Val Leu Leu Arg Gly Asp His Thr Arg Cys Lys 
980 985 990 

Thr Val Leu Thr Gly Lys Val Gly Gly Leu Leu Ala Phe Ala Lys Arg 
995 1000 1005 

Arg Asn Cys Cys lie Gly Cys Arg Thr Val Leu Ser His Gin Gly Ala 
1010 1015 1020 

Val Cys Glu Phe Cys Gin Pro Arg Glu Ser Glu Leu Tyr Gin Lys Glu 
1025 1030 1035 1040 

Val Ser His Leu Asn Ala Leu Glu Glu Arg Phe Ser Arg Leu Trp Thr 
1045 1050 1055 

Gin Cys Gin Arg Cys Gin Gly Ser Leu His Glu Asp Val lie Cys Thr 
1060 1065 1070 

Ser Arg Asp Cys Pro lie Phe Tyr Met Arg Lys Lys Val Arg Lys Asp 
1075 1080 1085 

Leu Glu Asp Gin Glu Gin Leu Leu Arg Arg Phe Gly Pro Pro Gly Pro 
1090 1095 1100 

Glu Ala Trp 
1105 

(2) ANGABEN ZU SEQ ID NO: 23: 

(i) SEQUENZKENNZEICHEN: 

(A) LANGE: 781 Aminosauren 

(B) ART: Aminosaure 

(C) STRANGFORM: Einzelstrang 
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(D) TOPOLOGIE: linear 

^ (ii) ART DES MOLEKULS: Protein 

(vi) URSPRONLICHE HERKUNFT: 

(A) ORGANISMUS: Archaeoglobus fulgidus 

(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 23: 

15 

iVIet Glu Arg Val Glu Gly Trp Leu lie Asp Ala Asp Tyr Glu Thr lie 
15 10 15 

20 Gly Gly Lys Ala Val Val Arg Leu Trp Cys Lys Asp Asp Gin Gly He 
20 25 30 

Phe Val Ala Tyr Asp Tyr Asn Phe Asp Pro Tyr Phe Tyr Val +le Gly 
35 40 45 • 

Val Asp Glu Asp lie Leu Lys Asn Ala Ala Thr Ser Thr Arg Arg Glu 
30 50 55 60 

Val lie Lys Leu Lys Ser Phe Glu Lys Ala Gin Leu Lys Thr Leu Gly 
65 70 75 80 

Arg Glu Val Glu Gly Tyr He Val Tyr Ala His His Pro Gin His Val 
85 90 95 

40 Pro Lys Leu Arg Asp Tyr Leu Ser Gin Phe Gly Asp Val Arg Glu Ala 
100 105 110 

Asp lie Pro Phe Ala Tyr Arg Tyr Leu lie Asp Lys Asp Leu Ala Cys 
45 115 120 125 

Met Asp Gly lie Ala He Glu Gly Glu Lys Gin Gly Gly Val lie Arg 
130 135 140 

50 

Ser Tyr Lys lie Glu Lys Val Glu Arg lie Pro Arg Met Glu Phe Pro 
145 150 155 160 

Glu Leu Lys Met Leu Val Phe Asp Cys Glu Met Leu Ser Ser Phe Gly 
165 170 175 

^ Met Pro Glu Pro Glu Lys Asp Pro He He Val lie Ser Val Lys Thr 
180 185 190 
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Asn Asp Asp Asp Glu lie lie Leu Thr Gly Asp Glu Arg Lys lie lie 
195 200 205 

Ser Asp Phe Val Lys Leu lie Lys Ser Tyr Asp Pro Asp lie lie Val 
210 215 220 

Gly Tyr Asn Gin Asp Ala Phe Asp Trp Pro Tyr Leu Arg Lys Arg Ala 
225 230 235 240 

Glu Arg Trp Asn He Pro Leu Asp Val Gly Arg Asp Gly Ser Asn Val 
245 250 .255 

Val Phe Arg Gly Gly Arg Pro Lys lie Thr Gly Arg Leu Asn Val Asp 
260 265 270 

Leu Tyr Asp lie Ala Met Arg lie Ser Asp lie Lys lie Lys Lys Leu 
275 280 285 

Glu Asn Val Ala Glu Phe Leu Gly Thr Lys lie Glu lie Ala Asp lie 
290 295 300 

Glu Ala Lys Asp lie Tyr Arg Tyr Trp Ser Arg Gly Glu Lys Glu Lys 
305 310 315 320 

Val Leu Asn Tyr Ala Arg Gin Asp Ala lie Asn Thr Tyr Leu lie Ala 
325 330 335 

Lys Glu Leu Leu Pro Met His Tyr Glu Leu Ser Lys Met lie Arg Leu 
340 345 350 

Pro Val Asp Asp Val Thr Arg Met Gly Arg Gly Lys Gin Val Asp Trp 
355 360 365 

Leu Leu Leu Ser Glu Ala Lys Lys lie Gly Glu lie Ala Pro Asn Pro 
370 375 380 

Pro Glu His Ala Glu Ser Tyr. Glu Gly Ala Phe Val Leu Glu Pro Glu 
385 390 395 400 

Arg Gly Leu His Glu Asn Val Ala Cys Leu Asp Phe Ala Ser Met Tyr 
405 410 415 

Pro Ser He Met lie Ala Phe Asn lie Ser Pro Asp Thr Tyr Gly Cys 
420 425 430 

Arg Asp Asp Cys Tyr Glu Ala Pro Glu Val Gly His Lys Phe Arg Lys 
435 440 445 
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Ser Pro Asp Gly Phe Phe Lys Arg lie Leu Arg Met Leu lie Glu Lys 
450 455 460 

Arg Arg Glu Leu Lys Val Glu Leu Lys Asn Leu Ser Pro Glu Ser Ser 
465 470 475 480 

,„ Glu Tyr Lys Leu Leu Asp lie Lys Gin Gin Thr Leu Lys Val Leu Thr 
485 490 495 

Asn Ser Phe Tyr Gly Tyr Met Gly Trp Asn Leu Ala Arg Trp Tyr Cys 
15 500 505 510 

His Pro Cys Ala Glu Ala Thr Thr Ala Trp Gly Arg His Phe lie Arg 
515 520 525 

20 

Thr Ser Ala Lys lie Ala Glu Ser Met Gly Phe Lys Val Leu Tyr Gly 
530 535 540 

Asp Thr Asp Ser lie Phe Val Thr Lys Ala Gly Met Thr Lys $lu Asp 
545 550 555 560 

^ Val Asp Arg Leu lie Asp Lys Leu His Glu Glu Leu Pro He Gin lie 
565 570 575 

Glu Val Asp Glu Tyr Tyr Ser Ala lie Phe Phe Val Glu Lys Lys Arg 
35 580 585 590 

Tyr Ala Gly Leu Thr Glu Asp Gly Arg Leu Val Val Lys Gly Leu Glu 
595 600 605 

40 

Val Arg Arg Gly Asp Trp Cys Glu Leu Ala Lys Lys Val Gin Arg Glu 
610 615 620 

45 Val lie Glu Val lie Leu Lys Glu Lys Asn Pro Glu Lys Ala Leu Ser 
625 630 635 640 

Leu Val Lys Asp Val lie Leu Arg lie Lys Glu Gly Lys Val Ser Leu 
^ 645 650 655 

Glu Glu Val Val He Tyr Lys Gly Leu Thr Lys Lys Pro Ser Lys Tyr 
5j 660 665 670 

Glu Ser Met Gin Ala His Val Lys Ala Ala Leu Lys Ala Arg Glu Met 
675 680 685 

(SO 

Gly lie Ije Tyr Pro Val Ser Ser Lys lie Gly Tyr Val lie Val Lys 
690 695 700 
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Gly Ser Gly Asn lie Gly Asp Arg Ala Tyr Pro lie Asp Leu lie Glu 
705 710 715 720 

Asp Phe Asp Gly Glu Asn Leu Arg lie Lys Thr Lys Ser Gly lie Glu 
725 730 735 

lie Lys Lys Leu Asp Lys Asp Tyr Tyr He Asp Asn Gin lie He Pro 
740 745 750 

Ser Val Leu Arg lie Leu Glu Arg Phe Gly Tyr Thr Glu Ala Ser Leu 
755 760 765 

Lys Gly Ser Ser Gin Met Ser Leu Asp Ser Phe Phe Ser 
770 775 780 

(2) ANGABEN ZU SEQ ID NO: 24: 

li) SEQUENZKENNZEICHEN: 

(A) LANGE: 1 634 Aminosauren X 

(B) ART: Aminosaure 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: linear 

(ii) ART DES MOLEKOLS: Protein 

(vi) URSPRONUCHE HERKUNFT: 

(A) ORGANISMUS: Methanococcus jannaschii 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 24: 

Met Gly Met Ser Met Gly Lys lie Lys lie Asp Ala Leu He Asp Asn 
15 10 15 

Thr Tyr Lys Thr lie Glu Asp Lys Ala Val lie Tyr Leu Tyr Leu lie 
20 25 30 

Asn Ser lie Leu Lys Asp Arg Asp Phe Lys Pro Tyr Phe Tyr Val Glu 
35 40 45 

Leu His Lys Glu Lys Val Glu Asn Glu Asp lie Glu Lys He Lvs Glu 
50 55 60 

Phe Leu Leu Lys Asn Asp Leu Leu Lys Phe Val Glu Asn He Glu Val 
65 70 75 80 
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Val Lys Lys He lie Leu Arg Lys Glu Lys Glu Val lie Lys He lie 
85 90 95 

Ala Thr His Pro Gin Lys Val Pro Lys Leu Arg Lys lie Lys Glu Cys 
100 105 110 



10 



Glu lie Val Lys Glu lie Tyr Glu His Asp lie Pro Phe Ala Lys Arg 
115 120 125 



IS 



Tyr Leu lie Asp Asn Glu lie lie Pro Met Thr Tyr Trp Asp Phe Glu 
130 135 140 



20 



Asn Lys Lys Pro Val Ser lie Glu lie Pro Lys Leu Lys Ser Val Ala 
145 .150 155 160 

Phe Asp Met Glu Val Tyr Asn Arg Asp Thr Glu Pro Asn Pro Glu Arg 
165 170 175 



25 



Asp Pro lie Leu Met Ala Ser Phe Trp Asp Glu Asn Gly Gli|( Lys Val 
180 185 190 



3„ lie Thr Tyr Lys Glu Phe Asn His Pro Asn lie Glu Val Val Lys Asn 
195 200 205 



35 



Glu Lys Glu Leu lie Lys Lys lie lie Glu Thr Leu Lys Glu Tyr Asp 
210 215 220 



Val lie Tyr Thr Tyr Asn Gly Asp Asn Phe Asp Phe Pro Tyr Leu Lys 
225 230 235 240 

40 

Ala Arg Ala Lys lie Tyr Gly lie Asp lie Asn Leu Gly Lys Asp Gly 
245 250 255 

« Glu Glu Leu Lys lie Lys Arg Gly Gly Met Glu Tyr Arg Ser Tyr lie 
260 265 270 



50 



Pro Gly Arg Val His lie Asp Leu Tyr Pro He Ser Arg Arg Leu Leu 
275 280 285 



5S 



AO 



Lys Leu Thr Lys Tyr Thr Leu Glu Asp Val Val Tyr Asn Leu Phe Gly 
290 295 300 

lie Glu Lys Leu Lys lie Pro His Thr Lys lie Val Asp Tyr Trp Ala 
305 310 315 320 

Asn Asn Asp Lys Thr Leu lie Glu Tyr Ser Leu Gin Asp Ala Lys Tyr 
325 330 335 
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Thr Tyr Lys lie Gly Lys Tyr Phe Phe Pro Leu Glu Val Met Phe Ser 
340 345 350 

Arg lie Val Ash Gin Thr Pro Phe Glu lie Thr Arg Met Ser Ser Gly ' 
355 360 365 

Gin Met Val Glu Tyr Leu Leu Met Lys Arg Ala Phe Lys Glu Asn Met 
370 375 380 

lie Val Pro Asn Lys Pro Asp Glu Glu Glu Tyr Arg Arg Arg Val Leu 

385 390 395 400 ,5 

Thr Thr Tyr Glu Gly Gly Tyr Val Lys Glu Pro Glu Lys Gly Met Phe 
405 410 415 

20 

Glu Asp lie lie Ser Met Asp Phe Arg Cys His Pro Lys Gly Thr Lys 
420 425 430 

Val Val Val Lys Gly Lys Gly lie Val Asn lie Glu Asp Val Ly§ Glu 
435 440 445 

Gly Asn Tyr Val Leu Gly lie Asp Gly Trp Gin Lys Val Lys Lys Val 

450 455 460 ^ 

Trp Lys Tyr Glu Tyr Glu Gly Glu Leu lie Asn Val Asn Gly Leu Lys 

465 470 475 480 3- 

Cys Thr Pro Asn His Lys lie Pro Leu Arg Tyr Lys lie Lys His Lys 
485 490 495 

40 

Lys lie Asn Lys Asn Asp Tyr Leu Val Arg Asp lie Tyr Ala Lys Ser 
500 505 510 

Leu Leu Thr Lys Phe Lys Gly Glu Gly Lys Leu lie Leu Cys Lys Asp 45 
515 520 525 

Phe Glu Thr lie Gly Asn Tyr Glu Lys Tyr lie Asn Asp Met Asp Glu 

530 535 540 so 

Asp Phe lie Leu Lys Ser Glu Leu lie Gly He Leu Leu Ala Glu Gly 
545 550 555 560 

... 55 

His Leu Leu Arg Arg Asp lie Glu Tyr Phe Asp Ser Ser Arg Gly Lys 
565 570 575 

Lys Arg lie Ser His Gin Tyr Arg Val Glu lie Thr Val Asn Glu Asp 
580 585 590 
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Glu Lys Asp Phe He Glu Lys lie Lys Tyr lie Phe Lys Lys Leu Phe 
595 600 605 

^ Asn Tyr Glu Leu Tyr Val Arg Arg Lys Lys Gly Thr Lys Ala lie Thr 
610 615 620 

,0 Leu Gly Cys Ala Lys Lys Asp lie Tyr Leu Lys lie Glu Glu lie Leu 
625 630 635 640 



IS 



Lys Asn Lys Glu Lys Tyr Leu Pro Asn Ala lie Leu Arg Gly Phe Phe 
645 650 655 



20 



25 



Glu Gly Asp Gly Tyr Val Asn Thr Val Arg Arg Ala Val Val Val Asn 
660 665 670 

Gin Gly Thr Asn Asn Tyr Asp Lys lie Lys Phe lie Ala Ser Leu Leu 
675 680 685 

Asp Arg Leu Gly lie Lys Tyr Ser Phe Tyr Thr Tyr Ser Tyr Gju Glu 
690 695 700 



30 



Arg Gly Lys Lys Leu Lys Arg Tyr Val lie Glu lie Phe Ser Lys Gly 
705 710 715 720 



35 



Asp Leu lie Lys Phe Ser lie Leu lie Ser Phe lie Ser Arg Arg Lys 
725 730 735 



Asn Asn Leu Leu Asn Glu lie lie Arg Gin Lys Thr Leu Tyr Lys lie 
740 745 750 

40 

Gly Asp Tyr Gly Phe Tyr Asp Leu Asp Asp Val Cys Val Ser Leu Glu 
755 760 765 

45 Ser Tyr Lys Gly Glu Val Tyr Asp Leu Thr Leu Glu Gly Arg Pro Tyr 
770 775 780 



50 



Tyr Phe Ala Asn Gly lie Leu Thr His Asn Ser Leu Tyr Pro Ser lie 
785 790 795 800 



55 



60 



lie lie Ser Tyr Asn lie Ser Pro Asp Thr Leu Asp Cys Glu Cys Cys 
805 810 815 

Lys Asp Val Ser Glu Lys lie Leu Gly His Trp Phe Cys Lys Lys Lys 
820 825 830 

Glu Gly Leu lie Pro Lys Thr Leu Arg Asn Leu lie Glu Arg Arg lie 
835 840 845 
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Asn lie Lys Arg Arg Met Lys Lys Met Ala Glu lie Gly Glu He Asn 
850 855 860 

5 

Glu Glu Tyr Asn Leu Leu Asp Tyr Glu Gin Lys Ser Leu Lys He Leu 
865 870 875 880 

Ala Asn Ser He Leu Pro Asp Glu Tyr Leu Thr lie He Glu Glu Asp lo 
885 890 895 

Gly He Lys Val Val Lys lie Gly Glu Tyr He Asp Asp Leu Met Arg 

900 905 910 15 

Lys His Lys Asp Lys He Lys Phe Ser Gly lie Ser Glu He Leu Glu 
915 920 925 

20 

Thr Lys Asn Leu Lys Thr Phe Ser Phe Asp Lys He Thr Lys Lys Cys 
930 935 940 

Glu He Lys Lys Val Lys Ala Leu lie Arg His Pro Tyr Phe GlytLys ^ 
945 950 955 960 

Ala Tyr Lys He Lys Leu Arg Ser Gly Arg Thr lie Lys Val Thr Arg so 
965 970 975 

Gly His Ser Leu Phe Lys Tyr Glu Asn Gly Lys He Val Glu Val Lys 

980 985 990 35 

Gly Asp Asp Val Arg Phe Gly Asp Leu He Val Val Pro Lys Lys Leu 
995 1000 1005 

40 

Thr Cys Val Asp Lys Glu Val Val He Asn He Pro Lys Arg Leu He 
1010 1015 1020 

Asn Ala Asp Glu Glu Glu He Lys Asp Leu Val He Thr Lys His Lys 
1025 1030 1035 1040 

Asp Lys Ala Phe Phe Val Lys Leu Lys Lys Thr Leu Glu Asp lie Glu 

1045 1050 1055 ^ 

Asn Asn Lys Leu Lys Val He Phe Asp Asp Cys He Leu Tyr Leu Lys 

1060 1065 1070 55 

Glu Leu Gly Leu He Asp Tyr Asn He He Lys Lys He Asn Lys Val 
1075 1080 1085 

60 

Asp He Lys He Leu Asp Glu Glu Lys Phe Lys Ala Tyr Lys Lys Tyr 
1090 1095 1100 

65 
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Phe Asp Thr Val lie Glu His Gly Asn Phe Lys Lys Gly Arg Cys Asn 
1105 1110 1115 1120 

^ lie Gin Tyr lie Lys lie Lys Asp Tyr lie Ala Asn He Pro Asp Lys 
1125 1130 1135 

,u Glu Phe Glu Asp Cys Glu lie Gly Ala Tyr Ser Gly Lys lie Asn Ala 
1140 1145 1150 

Leu Leu Lys Leu Asp Glu Lys Leu Ala Lys Phe Leu Gly Phe Phe Val 
IS 1155 1160 1165 

Thr Arg Gly Arg Leu Lys Lys Gin Lys Leu Lys Gly Glu Thr Val Tyr 
1170 1175 1180 

20 

Glu He Ser Val Tyr Lys Ser Leu Pro Glu Tyr Gin Lys Glu lie Ala 
1185 1190 1195 1200 

^ Glu Thr Phe Lys Glu Val Phe Gly Ala Gly Ser Met Val Lys ^sp Lys 
1205 1210 1215 

30 Val Thr Met Asp Asn Lys lie Val Tyr Leu Val Leu Lys Tyr lie Phe 
1220 1225 1230 

Lys Cys Gly Asp Lys Asp Lys Lys His lie Pro Glu Glu Leu Phe Leu 
35 1235 1240 1245 

Ala Ser Glu Ser Val He Lys Ser Phe Leu Asp Gly Phe Leu Lys Ala 
1250 1255 1260 

40 

Lys Lys Asn Ser His Lys Gly Thr Ser Thr Phe Met Ala Lys Asp Glu 
1265 1270 1275 1280 

« Lys Tyr Leu Asn Gin Leu Met He Leu Phe Asn Leu Val Gly He Pro 
1285 1290 1295 

Thr Arg Phe Thr Pro Val Lys Asn Lys Gly Tyr Lys Leu Thr Leu Asn 
^ 1300 1305 1310 

Pro Lys Tyr Gly Thr Val Lys Asp Leu Met Leu Asp Glu Val Lys Glu 
55 1315 1320 1325 

lie Glu Ala Phe Glu Tyr Ser Gly Tyr Val Tyr Asp Leu Ser Val Glu 
1330 1335 1340 

60 

Asp Asn Glu Asn Phe Leu Val Asn Asn lie Tyr Ala His Asn Ser Val 
1345 1350 1355 1360 
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Tyr Gly Tyr Leu Ala Phe Pro Arg Ala Arg Phe Tyr Ser Arg Glu Cys 
1365 1370 1375 

Ala Glu lie Val Thr Tyr Leu Gly Arg Lys Tyr lie Leu Glu Thr Val 
1380 1385 1390 

Lys Glu Ala Glu Lys Phe. Gly Phe Lys Val Leu Tyr lie Asp Thr Asp 
1395 1400 1405 

Gly Phe Tyr Ala lie Trp Lys Glu Lys lie Ser Lys Glu Glu Leu lie 
1410 1415 1420 

Lys Lys Ala Met Glu Phe Val Glu Tyr lie Asn Ser Lys Leu Pro Gly 
1425 1430 1435 1440 

Thr Met Glu Leu Glu Phe Glu Gly Tyr Phe Lys Arg Gly lie Phe Val 
1445 1450 1455 

Thr Lys Lys Arg Tyr Ala Leu lie Asp Glu Asn Gly Arg Val thr Val 
1460 1465 1470 

Lys Gly Leu Glu Phe Val Arg Arg Asp Trp Ser Asn lie Ala Lys tie 
1475 1480 1485 

Thr Gin Arg Arg Val Leu Glu Ala Leu Leu Val Glu Gly Ser lie Glu 
1490 1495 1500 

Lys Ala Lys Lys He lie Gin Asp Val lie Lys Asp Leu Arg Glu Lys 
1505 1510 1515 1520 

Lys lie Lys Lys Glu Asp Leu lie lie Tyr Thr Gin Leu Thr Lys Asp 
1525 1530 1535 

Pro Lys Glu Tyr Lys Thr Thr Ala Pro His Val Glu lie Ala Lys Lys 
1540 1545 1550 

Leu Met Arg Glu Gly Lys Arg lie Lys Val Gly Asp lie lie Gly Tyr 
1555 1560 1565 

lie lie Val Lys Gly Thr Lys Ser lie Ser Glu Arg Ala Lys Leu Pro 
1570 1575 1580 

Glu Glu Val Asp lie Asp Asp lie Asp Val Asn Tyr Tyr lie Asp Asn 
1585 1590 1595 1600 

Gin lie Leu Pro Pro Val Leu Arg tie Met Glu Ala Val Gly Val Ser 
.1605 1610 1615 



8] 
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DE 198 40 771 Al 



Lys Asn Glu Leu Lys Lys Glu Gly Ala Gin Leu Thr Leu Asp Lys Phe 
1620 1625 1630 



Phe Lys 



(2) ANGABEN ZU SEQ ID NO: 25: 

li) SEQUENZKENNZEICHEN: 

(A) LANGE: 1235 Aminosauren 

(B) ART: Aminosaure 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: linear 

(li) ART DES MOLEKOlS; Protein 

(vi) URSPRONLICHE HERKUNFT: 

(A) ORGANISMUS: Pyrococcus horlkoshii 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 25: 

Met lie Leu Asp Ala Asp Tyr lie Thr Glu Asp Gly Lys Pro lie lie 
1 5 10 15 

Arg lie Phe Lys Lys Glu Asn Gly Glu Phe Lys Val Glu Tyr Asp Arg 
20 25 30 

Asn Phe Arg Pro Tyr lie Tyr Ala Leu Leu Arg Asp Asp Ser Ala lie 
35 40 45 

Asp Glu lie Lys Lys lie Thr Ala Gin Arg His Gly Lys Val Val Arg 
50 55 60 

lie Val Glu Thr Glu Lys lie Gin Arg Lys Phe Leu Gly Arg Pro lie 
65 70 75 80 

Glu Val Trp Lys Leu Tyr Leu Glu His Pro Gin Asp Val Pro Ala He 
85 90 95 

Arg Asp Lys lie Arg Glu His Pro Ala Val Val Asp lie Phe Glu Tyr 
100 105 110 

Asp lie Pro Phe Ala Lys Arg Tyr Leu lie Asp Lys Gly Leu Thr Pro 
115 120 125 
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Met Glu Gly Asn Glu Lys Leu Thr Phe Leu Ala Val Asp lie Glu Thr 
130 135 140 

Leu Tyr His Glu Gly Glu Glu Phe Gly Lys Gly Pro Val lie Met lie 
145 150 155 160 

Ser Tyr Ala Asp Glu Glu Gly Ala Lys Val lie Thr Trp Lys Lys lie 

165 170 175 " 

Asp Leu Pro Tyr Val Glu Val Val Ser Ser Glu Arg Glu Met lie Lys 
180 185 190 

Arg Leu He Arg Val lie Lys Glu Lys Asp Pro Asp Val lie lie Thr 
195 200 205 

Tyr Asn Gly Asp Asn Phe Asp Phe Pro Tyr Leu Leu Lys Arg Ala Glu 
210 215 220 

Lys Leu Gly lie Lys Leu Leu Leu Gly Arg Asp Asn Ser Glu Pro Lys ^ 
225 230 235 240 

Met Gin Lys Met Gly Asp Ser Leu Ala Val Glu lie Lys Gly Arg lie 

245 250 255 ^ 

His Phe Asp Leu Phe Pro Val lie Arg Arg Thr lie Asn Leu Pro Thr 
260 265 270 

35 

Tyr Thr Leu Glu Ala Val Tyr Glu Ala lie Phe Gly Lys Pro Lys Glu 
275 280 285 

40 

Lys Val Tyr Ala Asp Glu lie Ala Lys Ala Trp Glu Thr Gly Glu Gly 
290 295 300 

Leu Glu Arg Val Ala Lys Tyr Ser Met Glu Asp Ala Lys Val Thr Tyr 45 
305 310 315 320 

Glu Leu Gly Arg Glu Phe Phe Pro Met Glu Ala Gin Leu Ala Arg Leu 

325 330 335 so 

Val Gly Gin Pro Val Trp Asp Val Ser Arg Ser Ser Thr Gly Asn Leu 
340 345 350 

55 

Val Glu Trp Phe Leu Leu Arg Lys Ala Tyr Glu Arg Asn Glu Leu Ala 
355 360 365 

Pro Asn Lys Pro Asp Glu Lys Glu Tyr Glu Arg Arg Leu Arg Glu Ser ^ 
370 375 380 



65 
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Tyr Glu Gly Gly Tyr Val Lys Glu Pro Glu Lys Gly Leu Trp Glu Gly 
385 390 395 400 

lie Val Ser Leu Asp Phe Arg Ser Leu Tyr Pro Ser lie lie lie Thr 
405 410 415 



10 



His Asn Val Ser Pro Asp Thr Leu Asn Arg Glu Gly Cys Glu Glu Tyr 
420 425 430 



15 



Asp Val Ala Pro Lys Val Gly His Arg Phe Cys Lys Asp Phe Pro Gly 
435 440 445 



20 



Phe lie Pro Ser Leu Leu Gly Gin Leu Leu Glu Glu Arg Gin Lys lie 
450 455 460 

Lys Lys Arg Met Lys Glu Ser Lys Asp Pro Val Glu Lys Lys Leu Leu 
465 470 475 480 



25 



Asp Tyr Arg Gin Arg Ala lie Lys lie Leu Ala Asn Ser lie Leii Pro 
485 490 495 



30 



Asp Glu Trp Leu Pro lie Val Glu Asn Glu Lys Val Arg Phe Val Lys 
500 505 510 



35 



lie Gly Asp Phe lie Asp Arg Glu He Glu Glu Asn Ala Glu Arg Val 
515 520 525 



Lys Arg Asp Gly Glu Thr Glu lie Leu Glu Val Lys Asp Leu Lys Ala 
530 535 540 

40 

Leu Ser Phe Asn Arg Glu Thr Lys Lys Ser Glu Leu Lys Lys Val Lys 
545 550 555 560 

45 Ala Leu lie Arg His Arg Tyr Ser Gly Lys Val Tyr Ser lie Lys Leu 
565 570 575 



50 



Lys Ser Gly Arg Arg lie Lys lie Thr Ser Gly His Ser Leu Phe Ser 
580 585 590 



55 



60 



Val Lys Asn Gly Lys Leu Val Lys Val Arg Gly Asp Glu Leu Lys Pro 
595 600 605 

Gly Asp Leu Val Val Val Pro Gly Arg Leu Lys Leu Pro Glu Ser Lys 
610 615 620 

Gin Val Leu Asn Leu Val Glu Leu Leu Leu Lys Leu Pro Glu Glu Glu 
625 630 635 640 



65 
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Thr Ser Asn lie Val Met Met lie Pro Val Lys Gly Arg Lys Asn Phe 
645 650 655 

Phe Lys Gly Met Leu Lys Thr Leu Tyr Trp He Phe Gly Glu Gly Glu ' 
660 665 670 

Arg Pro Arg Thr Ala Gly Arg Tyr Leu Lys His Leu Glu Arg Leu Gly 
675 680 685 

Tyr Val Lys Leu Lys Arg Arg Gly Cys Glu Val Leu Asp Trp Glu Ser 

690 695 700 u 

Leu Lys Arg Tyr Arg Lys Leu Tyr Glu Thr Leu lie Lys Asn Leu Lys 
705 710 715 720 

20 

Tyr Asn Gly Asn Ser Arg Ala Tyr Met Val Glu Phe Asn Ser Leu Arg 
725 730 735 

Asp Val Val Ser Leu Met Pro lie Glu Glu Leu Lys Glu Trp ll« He ^ 
740 745 750 

Gly Glu Pro Arg Gly Pro Lys lie Gly Thr Phe lie Asp Val Asp Asp 
755 760 765 

Ser Phe Ala Lys Leu Leu Gly Tyr Tyr lie Ser Ser Gly Asp Val Glu 

770 775 780 33 

Lys Asp Arg Val Lys Phe His Ser Lys Asp Gin Asn Val Leu Glu Asp 
785 790 795 800 

40 

lie Ala Lys Leu Ala Glu Lys Leu Phe Gly Lys Val Arg Arg Gly Arg 
805 810 815 

Gly Tyr lie Glu Val Ser Gly Lys lie Ser His Ala lie Phe Arg Val 
820 825 830 

Leu Ala Glu Gly Lys Arg lie Pro Glu Phe lie Phe Thr Ser Pro Met 
835 840 845 

Asp lie Lys Val Ala Phe Leu Lys Gly Leu Asn Gly Asn Ala Glu Glu 
850 855 860. 

.. - . ■ 55 

Leu Thr Phe Ser Thr Lys Ser Glu Leu Leu Val Asn Gin Leu lie Leu 
865 870 875 880 

Leu Leu Asn Ser lie Gly Val Ser Asp lie Lys lie Glu His Glu Lys ^ 
885 890 895 
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Gly Val Tyr Arg Val Tyr lie Asn Lys Lys Glu Ser Ser Asn Gly Asp 
900 905 910 

* lie Val Leu Asp Ser Val Glu Ser lie Glu Val Glu Lys Tyr Glu Gly 
915 920 925 

,1, Tyr Val Tyr Asp Leu Ser Val Glu Asp Asn Glu Asn Phe Leu Val Gly 
930 935 940 

Phe Gly Leu Leu Tyr Ala His Asn Ser Tyr Tyr Gly Tyr Tyr Gly Tyr 
IS 945 950 955 960 

Ala Lys Ala Arg Trp Tyr Cys Lys Glu Cys Ala Glu Ser Val Thr Ala 
965 970 975 

20 

Trp Gly Arg Gin Tyr lie Asp Leu Val Arg Arg Glu Leu Glu Ala Arg 
980 985 990 



25 



30 



40 



45 



Gly Phe Lys Val Leu Tyr lie Asp Thr Asp Gly Leu Tyr Ala Thr lie 
995 1000 1005 

Pro Gly Val Lys Asp Trp Glu Glu Val Lys Arg Arg Ala Leu Glu Phe 
1010 1015 1020 

Val Asp Tyr He Asn Ser Lys Leu Pro Gly Val Leu Glu Leu Glu Tyr 
1025 1030 1035 1040 

Glu Gly Phe Tyr Ala Arg Gly Phe Phe Val Thr Lys Lys Lys Tyr Ala 
1045 1050 1055 

Leu 1le Asp Glu Glu Gly Lys lie Val Thr Arg Gly Leu Glu He Val 
1060 1065 1070 

Arg Arg Asp Trp Ser Glu lie Ala Lys Glu Thr Gin Ala Arg Val Leu 
1075 1080 1085 



Glu Ala lie Leu Lys His Gly Asn Val Glu Glu Ala Val Lys lie Val 
^ 1090 1095 1100 

Lys Asp Val Thr Glu Lys Leu Thr Asn Tyr Glu Val Pro Pro Glu Lys 
1105 1110 1115 1120 



55 



60 



65 



Leu Val lie Tyr Glu Gin lie Thr Arg Pro lie Asn Glu Tyr Lys Ala 
1125 1130 1135 

He Gly Pro His Val Ala Val Ala Lys Arg Leu Met Ala Arg Gly lie 
1140 1145 1150 
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Lys Val Lys Pro Gly Met Val lie Gly Tyr He Val Leu Arg Gly Asp 
1155 1160 1165 

Gly Pro lie Ser Lys Arg Ala lie Ser lie Glu Glu Phe Asp Pro Arg 
1170 1175 1180 

Lys His Lys Tyr Asp Ala Glu Tyr Tyr lie Glu Asn Gin Val Leu Pro 
1185 1190 1195 1200 

Ala Val Glu Arg Me Leu Lys Ala Phe Gly Tyr Lys Arg Glu Asp Leu 
1205 1210 1215 

Arg Trp Gin Lys Thr Lys Gin Val Gly Leu Gly Ala Trp lie Lys Val 
1220 1225 1230 

Lys Lys Ser 
1235 

(2) ANGABEN ZU SEQ ID NO: 26: n 

10 SEQUENZKENN2EICHEN: 

(A) LANGE: 586 Aminosauren 
(8) ART: Aminosaure 
(C) STRANGFORM: EInzelstrang 
ID) TOPOLOGIE: linear 

(ii) ART DES MOLEKULS: Protein 

(vi) URSPRONLICHE HERKUNFT: 

(A) ORGANISMUS: Methanobacterium thermoautotrophicum 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 26: 

Met Glu Asp Tyr Arg Met Val Leu Leu Asp He Asp Tyr Val Thr Val 
15 10 15 

Asp Glu Val Pro Val lie Arg Leu Phe Gly Lys Asp Lys Ser Gly Gly 
20 25 30 

Asn Glu Pro lie lie Ala His Asp Arg Ser Phe Arg Pro Tyr He Tyr 
35 40 45 

Ala He Pro Thr Asp Leu Asp Glu Cys Leu Arg Glu Leu Glu Glu Leu 
50 55 60 
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Glu Leu Glu Lys Leu Glu Val Lys Glu Met Arg Asp Leu Gly Arg Pro 
65 70 75 80 

Thr Glu Val lie Arg lie Glu Phe Arg His Pro Gin Asp Val Pro Lys 
85 90 95 



10 



lie Arg Asp Arg lie Arg Asp Leu Glu Ser Val Arg Asp lie Arg Glu 
100 105 110 



His Asp lie Pro Phe Tyr Arg Arg Tyr Leu lie Asp Lys Ser lie Val 
15 115 120 125 

Pro Met Glu Glu Leu Glu Phe Gin Gly Val Glu Val Asp Ser Ala Pro 
130 135 140 



20 



Ser Val Thr Thr Asp Val Arg Thr Val Glu Val Thr Gly Arg Val Gin 
145 150 155 160 



25 



Ser Thr Gly Ser Gly Ala His Gly Leu Asp lie Leu Ser Phe A$p lie 
165 170 175 



^ Glu Val Arg Asn Pro His Gly Met Pro Asp Pro Glu Lys Asp Glu lie 
180 185 190 

Val Met lie Gly Val Ala Gly Asn Met Gly Tyr Glu Ser Val lie Ser 
35 195 200 205 

Thr Ala Gly Asp His Leu Asp Phe Val Glu Val Val Glu Asp Glu Arg 
210 215 220 

40 

Glu Leu Leu Glu Arg Phe Ala Glu lie Val He Asp Lys Lys Pro Asp 
225 230 235 240 

45 lie Leu Val Gly Tyr Asn Ser Asp Asn Phe Asp Phe Pro Tyr lie Thr 
245 250 255 

Arg Arg Ala Ala lie Leu Gly Ala Glu Leu Asp Leu Gly Trp Asp Gly 
260 265 270 



55 



60 



Ser Lys lie Arg Thr Met Arg Arg Gly Phe Ala Asn Ala Thr Ala lie 
275 280 285 

Lys Gly Thr Val His Val Asp Leu Tyr Pro Val Met Arg Arg Tyr Met 
290 295 300 

Asn Leu Asp Arg Tyr Thr Leu Glu Arg Val Tyr Gin Glu Leu Phe Gly 
305 310 315 320 



65 
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Glu Glu Lys lie Asp Leu Pro Gly Asp Arg Leu Trp Glu Tyr Trp Asp 
325 330 335 

Arg Asp Glu Leu Arg Asp Glu Leu Phe Arg Tyr Ser Leu Asp Asp Val * 
340 345 350 

Val Ala Thr His Arg lie Ala Glu Lys lie Leu Pro Leu Asn Leu Glu 
355 360 365 

Leu Thr Arg Leu Val Gly Gin Pro Leu Phe Asp He Ser Arg Met Ala 

370 375 380 u 

Thr Gly Gin Gin Ala Glu Trp Phe Leu Val Arg Lys Ala Tyr Gin Tyr 
385 390 395 400 

20 

Gly Glu Leu Val Pro Asn Lys Pro Ser Gin Ser Asp Phe Ser Ser Arg 
405 410 415 

Arg Gly Arg Arg Ala Val Gly Gly Tyr Val Lys Glu Pro Glu Lys Gly ^ 
420 425 430 

Leu His Glu Asn lie Val Gin Phe Asp Phe Arg Ser Leu Tyr Pro Ser 
435 440 445 

lie lie lie Ser Lys Asn He Ser Pro Asp Thr Leu Thr Asp Asp Glu 
450 455 460 

35 

Glu Ser Glu Cys Tyr Val Ala Pro Glu Tyr Gly Tyr Arg Phe Arg Lys 
465 470 475 480 

40 

Ser Pro Arg Gly Phe Val Pro Ser Val lie Gly Glu lie Leu Ser Glu 
485 490 495 

Arg Val Arg lie Lys Glu Glu Met Lys Gly Ser Asp Asp Pro Met Glu 45 
500 505 510 

Arg Lys lie Leu Asn Val Gin Gin Glu Ala Leu Lys Arg Leu Ala Asn 

515 520 525 so 

Thr Met Tyr Gly Val Tyr Gly Tyr Ser Arg Phe Arg Trp Tyr Ser Met 
530 535 540 

... : ■ 55 

Glu Cys Ala Glu Ala He Thr Ala Trp Gly Arg Asp Tyr He Lys Lys 
545 550 555 560 

Thr He Lys Thr Ala Glu Glu Phe Gly Phe His Thr Val Tyr Ala Asp ^ 
565 570 575 
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Thr Asp Gly Phe Tyr Ala Jhr Tyr Arg Gly 
580 585 

(2) ANGABEN ZU SEQ ID NO: 27: 

(i) SEQUENZKENNZEICHEN: 

(A) LANGE: 1 143 Aminosauren 

(B) ART: Aminosaure 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: linear 

(ii) ART DES MOLEKULS: Protein 

(vi) URSPRONLICHE HERKUNFT: 

(A) ORGANISMUS: Archaeoglobus fulgidus 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 27: 

Met Asp Ala Thr Leu Asp Arg Phe Phe Pro Leu Phe Glu Ser Glu Ser 
1 5 10 15 

Asn Glu Asp Phe Trp Arg lie Glu Glu lie Arg Arg Tyr His Glu Ser 
20 25 30 

Leu Met Val Glu Leu Asp Arg He Tyr Arg lie Ala Glu Ala Ala Arg 
35 40 45 

Lys Lys Gly Leu Asp Pro Glu Leu Ser Val Glu lie Pro lie Ala Lys 
50 55 60 

Asn Met Ala Glu Arg Val Glu Lys Leu Met Asn Leu Gin Gly Leu Ala 
65 70 75 80 

Lys Arg lie Met Glu Leu Glu Glu Gly Gly Leu Ser Arg Glu Leu lie 
85 90 95 

Cys Phe Lys Val Ala Asp Glu He Val Glu Gly Lys Phe Gly Glu Met 
100 105 110 

Pro Lys Glu Glu Ala lie Asp Lys Ala Val Arg Thr Ala Val Ala lie 
115 120 125 

Met Thr Glu Gly Val Val Ala Ala Pro lie Glu Gly lie Ala Arg Val 
130 135 140 
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Arg lie Asp Arg Glu Asn Phe Leu Arg Val Tyr Tyr Ala Gly Pro lie 
145 150 155 160 

Arg Ser Ala Gly Gly Thr Ala Gin Val lie Ser Val Leu Val Ala Asp 
165 170 175 

Tyr Val Arg Arg Lys Ala Glu lie Gly Arg Tyr Val Pro Thr Glu Glu 
180 185 190 

Glu lie Leu Arg Tyr Cys Glu Glu lie Pro Leu Tyr Lys Lys Val Ala 
195 200 205 

Asn Leu Gin Tyr Leu Pro Ser Asp Glu Glu He Arg Leu lie Val Ser 
210 215 220 

Asn Cys Pro He Cys lie Asp Gly Glu Pro Thr Glu Ser Ala Glu Val 
225 230 235 240 

Ser Gly Tyr Arg Asn Leu Pro Arg Val Glu Thr Asn Arg Val^Arg Gly 
245 250 255 

Gly Met Ala Leu Val lie Ala Glu Gly He Ala Leu Lys Ala Pro Lys 
260 265 270 

Leu Lys Lys Met Val Asp Glu Val Gly lie Glu Gly Trp Glu Trp Leu 
275 280 285 

Asp Ala Leu lie Lys Gly Gly Gly Asp Ser Gly Ser Glu Glu Glu Lys 
290 295 300 

Ala Val He Lys Pro Lys Asp Lys Tyr Leu Ser Asp He Val Ala Gly 
305 310 315 320 

Arg Pro Val Leu Ser His Pro Ser Arg Lys Gly Gly Phe Arg Leu Arg 
325 330 335 

Tyr Gly Arg Ala Arg Asn Ser Gly Phe Ala Thr Val Gly Val Asn Pro 
340 345 350 

Ala Thr Met Tyr Leu Leu Glu Phe Val Ala Val Gly Thr Gin Leu Lys 
355 360 365 

Val Glu Arg Pro Gly Lys Ala Gly Gly Val Val Pro Val Ser Thr He 
370 375 380 

Glu Gly Pro Thr Val Arg Leu Lys Asn Gly Asp Val Val Lys He Asn 
385 390 395 400 
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Thr Leu Ser Glu Ala Lys Ala Leu Lys Gly Glu Val Ala Ala lie Leu 
405 410 415 

Asp Leu Gly Glu He Leu*lle Asn Tyr Gly Asp Phe Leu Glu Asn Asn 
420 425 430 



10 His Pro Leu lie Pro Ala Ser Tyr Thr Tyr Glu Trp Trp lie Gin Glu 
435 440 445 

Ala Glu Lys Ala Gly Leu Arg Gly Asp Tyr Arg Lys lie Ser Glu Glu 
15 450 455 460 



20 



25 



Glu Ala Leu Lys Leu Cys Asp Glu Phe His Vat Pro Leu His Pro Asp 
465 470 475 480 

Tyr Thr Tyr Leu Trp His Asp lie Ser Val Glu Asp Tyr Arg Tyr Leu 
485 490 495 

Arg Asn Phe Val Ser Asp Asn Gly Lys lie Glu Gly Lys His Gly Lys 
500 505 510 V 



30 



Ser Val Leu Leu Leu Pro Tyr Asp Ser Arg Val Lys Glu lie Leu Glu 
515 520 525 



35 



Ala Leu Leu Leu Glu His Lys Val Arg Glu Ser Phe He Val lie Glu 
530 535 540 



Thr Trp Arg Ala Phe lie Arg Cys Leu Gly Leu Asp Glu Lys Leu Ser 
545 550 555 560 

40 

Lys Val Ser Glu Val Ser Gly Lys Asp Val Leu Glu lie Val Asn Gly 
565 570 575 

45 lie Ser Gly lie Lys Val Arg Pro Lys Ala Leu Ser Arg lie Gly Ala 
580 585 590 



50 



Arg Met Gly Arg Pro Glu Lys Ala Lys Glu Arg Lys Met Ser Pro Pro 
595 600 605 



55 



60 



Pro His lie Leu Phe Pro Val Gly Met Ala Gly Gly Asn Thr Arg Asp 
610 615 620 

He Lys Asn Ala lie Asn Tyr Thr Lys Ser Tyr Asn Ala Lys Lys Gly 
625 630 635 640 

Glu lie Glu Val Glu lie Ala lie Arg Lys Cys Pro Gin Cys Gly Lys 
645 650 655 



65 
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Glu Thr Phe Trp Leu Lys Cys Asp Val Cys Gly Glu Leu Thr Glu Gin 
660 665 670 

Leu Tyr Tyr Cys Pro Ser Cys Arg Met Lys Asn Thr Ser Ser Val Cys 
675 680 685 

Glu Ser Cys Gly Arg Glu Cys Glu Gly Tyr Met Lys Arg Lys Val Asp 
690 695 700 

Leu Arg Glu Leu Tyr Glu Glu Ala lie Ala Asn Leu Gly Glu Tyr Asp 
705 710 715 720 

Ser Phe Asp thr lie Lys Gly Val Lys Gly Met Thr Ser Lys Thr Lys 
725 730 735 

lie Pro Glu Arg Leu Glu Lys Gly lie Leu Arg Val Lys His Gly Val 
740 745 750 

Phe Val Phe Lys Asp Gly Thr Ala Arg Phe Asp Ala Thr As^' Leu Pro 
755 760 765 

lie Thr His Phe Lys Pro Ala Glu lie Gly Val Ser Val Glu Lys Leu 
770 775 780 

Arg Glu Leu Gly Tyr Glu Arg Asp Tyr Lys Gly Ala Glu Leu Lys Asn 
785 790 795 800 

Glu Asn Gin lie Val Glu Leu Lys Pro Gin Asp Val He Leu Pro Lys 
805 810 815 

Ser Gly Ala Glu Tyr Leu Leu Arg Val Ala Asn Phe lie Asp Asp Leu 
820 825 830 

Leu Val Lys Phe Tyr Lys Met Glu Pro Phe Tyr Asn Ala Lys Ser Val 
835 840 845 

Glu Asp Leu lie Gly His Leu Val lie Gly Leu Ala Pro His Thr Ser 
850 855 860 

Ala Gly Val Leu Gly Arg lie lie Gly Phe Ser Asp Val Leu Ala Gly 
865 870 875 880 

Tyr Ala His Pro Tyr Phe His Ala Ala Lys Arg Arg Asn Cys Asp Gly 
885 890 895 

Asp Glu Asp Cys Phe Met Leu Leu Leu Asp Gly Leu Leu Asn Phe Ser 
900 905 910 
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Arg Lys Phe Leu Pro Asp Lys Arg Gly Gly Gin Met Asp Ala Pro Leu 
915 920 925 

Val Leu Thr Ala lie Val Asp Pro Arg Giu Val Asp Lys Glu Val His 
930 935 940 

Asn Met Asp lie Val Glu Arg Tyr Pro Leu Glu Phe Tyr Glu Ala Thr 
945 950 955 960 

Met Arg Phe Ala Ser Pro Lys Glu Met Glu Asp Tyr Val Glu Lys Val 
965 970 975 

Lys Asp Arg Leu Lys Asp Glu Ser Arg Phe Cys Gly Leu Phe Phe Thr 
980 985 990 

His Asp Thr Glu Asn lie Ala Ala Gly Val Lys Glu Ser Ala Tyr Lys 
995 1000 1005 

Ser Leu Lys Thr Met Gin Asp Lys Val Tyr Arg Gin Met'Glu Leii Ala 
1010 1015 1020 

Arg Met lie Val Ala Val Asp Glu His Asp Val Ala Glu Arg Val lie 
1025 1030 1035 1040 

Asn Val His Phe Leu Pro Asp lie lie Gly Asn Leu Arg Ala Phe Ser 
1045 1050 1055 

Arg Gin Glu Phe Arg Cys Thr Arg Cys Asn Thr Lys Tyr Arg Arg lie 
1060 1065 1070 

Pro Leu Val Gly Lys Cys Leu Lys Cys Gly Asn Lys Leu Thr Leu Thr 
1075 1080 1085 

Val His Ser Ser Ser lie Met Lys Tyr Leu Glu Leu Ser Lys Phe Leu 
1090 1095 1100 

Cys Glu Asn Phe Asn Val Ser Ser Tyr Thr Lys Gin Arg Leu Met Leu 
1105 1110 1115 1120 

Leu Glu Gin Glu lie Lys Ser Met Phe Glu Asn Gly Thr Glu Lys Gin 
1125 1130 1135 

Val Ser lie Ser Asp Phe Val 
1140 

(2) ANGABEN ZU SEQ ID NO: 28: 
(i) SEQUENZKENNZEICHEN: 
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(A) LANGE: 1 139 Aminosauren 

(B) ART: Aminosaure 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: linear 

(ii) ART DES MOLEKULS: Protein 

(vi) URSPRUNLICHE HERKUNFT: 

(A) ORGANISMUS: Methanococcus jannaschii 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 28: 

Met He Val Met Val His Val Ala Cys Ser Glu Asn Met Lys Lys Tyr 
1 5 10 15 

Phe Glu Asn lie Val Asp Glu Val Lys Lys lie Tyr Arg He Ala- Glu 
20 25 30 

Glu Cys Arg Lys Lys Gly Phe Asp Pro Thr Asp Glu Val Glu lie Pro 
35 40 45 

Leu Ala Ala Asp Met Ala Asp Arg Val Glu Gly Leu Val Gly Pro Lys 
50 55 60 

Gly Val Ala Glu Arg lie Arg Glu Leu Val Lys Glu Leu Gly Lys Glu 
65 70 75 80 

Pro Ala Ala Leu Glu lie Ala Lys Glu lie Val Glu Gly Lys Phe Gly 
85 90 95 

Asn Phe Asp Lys Glu Lys Lys Ala Glu Gin Ala Val Arg Thr Ala Leu 
100 105 110 

Ala Val Leu Thr Glu Gly lie Val Ala Ala Pro Leu Glu Gly lie Ala 
115 1 20 1 25 

Asp Val Lys lie Lys Lys Asn Pro Asp Gly Thr Glu Tyr Leu Ala lie 
130 135 140 

Tyr Tyr Ala Gly Pro lie Arg Ser Ala Gly Gly Thr Ala Gin Ala Leu 
145 150 155 160 

Ser Val Leu Val Gly Asp Phe Val Arg Lys Ala Met Gly Leu Asp Arg 
165 170 175 
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Tyr Lys Pro Thr Glu Asp Glu lie Glu Arg Tyr Val Glu Glu Val Glu 
180 185 190 

Leu Tyr Gin Ser Glu Val Gly Ser Phe Gin Tyr Asn Pro Thr Ala Asp 
1 95 200 205 



lU 



Glu lie Arg Thr Ala lie Arg Asn lie Pro He Glu lie Thr Gly Glu 
210 215 220 



Ala Thr Asp Asp Val Glu Val Ser Gly His Arg Asp Leu Pro Arg Val 
IS 225 230 235 240 

Glu Thr Asn Gin Leu Arg Gly Gly Ala Leu Leu Val Leu Val Glu Gly 
245 . 250 255 



20 



Val Leu Leu Lys Ala Pro Lys lie Leu Arg His Val Asp Lys Leu Gly 

260 265 270 



25 



lie Glu Gly Trp Asp Trp Leu Lys Asp Leu Met Ser Lys Lys ^lu Glu 
275 280 285 V 



30 



Lys Glu Glu Glu Lys Asp Glu Lys Val Asp Asp Glu Glu lie Asp Glu 
290 295 300 



3S 



Glu Glu Glu Glu lie Ser Gly Tyr Trp Arg Asp Val Lys lie Glu Ala 
305 310 315 320 



Asn Lys Lys Phe lie Ser Glu Val lie Ala Gly Arg Pro Val Phe Ala 
325 330 335 

40 

His Pro Ser Lys Val Gly Gly Phe Arg Leu Arg Tyr Gly Arg Ser Arg 
340 345 350 

45 Asn Thr Gly Phe Ala Thr Gin Gly Phe His Pro Ala Leu Met Tyr Leu 
355 360 365 



50 



Val Asp Glu Phe Met Ala Val Gly Thr Gin Leu Lys Thr Glu Arg Pro 
370 375 380 



55 



60 



Gly Lys Ala Thr Cys Val Val Pro Val Asp Ser lie Glu Pro Pro lie 
385 390 395 400 

Val Lys Leu Lys Asn Gly Asp Val lie Arg Val Asp Thr He Glu Lys 
405 410 415 

Ala Met Asp Val Arg Asn Arg Val Glu Glu lie Leu Phe Leu Gly Asp 
420 425 430 



65 
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Val Leu Val Asn Tyr Gly Asp Phe Leu Glu Asn Asn His Pro Leu Leu 
435 440 445 

Pro Ser Cys Trp Cys Glu Glu Trp Tyr Glu Lys lie Leu He Ala Asn 
450 455 460 

Asn lie Glu Tyr Asp Lys Asp Phe lie Lys Asn Pro Lys Pro Glu Glu 
465 470 475 480 

Ala Val Lys Phe Ala Leu Glu Thr Lys Thr Pro Leu His Pro Arg Phe 
485 490 495 

Thr Tyr His Trp His Asp Val Ser Lys Glu Asp lie He Leu Leu Arg 
500 505 510 

Asn Trp Leu Leu Lys Gly Lys Glu Asp Ser Leu Glu Gly Lys Lys Val 
515 520 525 

Trp He Val Asp Leu Glu He Glu Glu Asp Lys Lys Ala Lys Ai^ He 
530 535 540 

Leu Glu Leu He Gly Cys Cys His Leu Val Arg Asn Lys Lys Val He 
545 550 555 560 

lie Glu Glu Tyr Tyr Pro Leu Leu Tyr Ser Leu Gly Phe Asp Val Glu 
565 570 575 

Asn Lys Lys Asp Leu Val Glu Asn lie Glu Lys lie Leu Glu Ser Ala 
580 585 590 

Lys Asn Ser Met His Leu He Asn Leu Leu Ala Pro Phe Glu Val Arg 
595 600 605 

Arg Asn Thr Tyr Val Tyr Val Gly Ala Arg Met Gly Arg Pro Glu Lys 
610 615 620 

Ala Ala Pro Arg Lys Met Lys Pro Pro Val Asn Gly Leu Phe Pro He 
625 630 635 640 

Gly Asn Ala Gly Gly Gin Val Arg Leu He Asn Lys Ala Val Glu Glu 
645 650 655 

Asn Asn Thr Asp Asp Val Asp Val Ser Tyr Thr Arg Cys Pro Asn Cys 
660 665 670 

Gly Lys He Ser Leu Tyr Arg Val Cys Pro Phe Cys Gly Thr Lys Val 
675 680 685 
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Glu Leu Asp Asn Phe Gly Arg lie Lys Ala Pro Leu Lys Asp Tyr Trp 
690 695 700 

* Tyr Ala Ala Leu Lys Arg Leu Gly lie Asn Lys Pro Gly Asp Val Lys 
705 710 715 720 

,u Cys lie Lys Gly Met Thr Ser Lys Gin Lys lie Val Glu Pro Leu Glu 
725 730 735 

Lys Ala lie Leu Arg Ala lie Asn Glu Val Tyr Val Phe Lys Asp Gly 
IS 740 745 750 

Thr Thr Arg Phe Asp Cys Thr Asp Val Pro Val Thr His Phe Lys Pro 
755 760 765 

20 

Asn Glu lie Asn Val Thr Val Glu Lys Leu Arg Glu Leu Gly Tyr Asp 
770 775 780 

^ Lys Asp lie Tyr Gly Asn Glu Leu Val Asp Gly Glu Gin Val yal Glu 
785 790 795 800 

^ Leu Lys Pro Gin Asp Val He lie Pro Glu Ser Cys Ala Glu Tyr Phe 
805 810 815 

Val Lys Val Ala Asn Phe He Asp Asp Leu Leu Glu Lys Phe Tyr Lys 
35 820 825 830 

Val Glu Arg Phe Tyr Asn Val Lys Lys Lys Glu Asp Leu lie Gly His 
835 840 845 

40 

Leu Val lie Gly Met Ala Pro His Thr Ser Ala Gly Met Val Gly Arg 
850 855 860 

45 lie lie Gly Tyr Thr Lys Ala Asn Val Gly Tyr Ala His Pro Tyr Phe 
865 870 875 880 

His Ala Ala Lys Arg Arg Asn Cys Asp Gly Asp Glu Asp Ser Phe Phe 
885 890 895 

Leu Leu Leu Asp Ala Phe Leu Asn Phe Ser Lys Lys Phe Leu Pro Asp 
35 900 905 910 

Lys Arg Gly Gly Gin Met Asp Ala Pro Leu Val Leu Thr Thr He Leu 
915 920 925 

an 

Asp Pro Lys Glu Val Asp Gly Glu Val His Asn Met Asp Thr Met Trp 
930 935 940 
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Ser Tyr Pro Leu Glu Phe Tyr Glu Lys Thr Leu Glu Met Pro Ser Pro 
945 950 955 960 

Lys Glu Val Lys Glu Phe Met Glu Thr Val Glu Asp Arg Leu Gly Lys 
965 970 975 

Pro Glu Gin Tyr Glu Gly lie Gly Tyr Thr His Glu Thr Ser Arg lie 
980 985 990 

Asp Leu Gly Pro Lys Val Cys Ala Tyr Lys Thr Leu Gly Ser Met Leu 
995 1000 1005 

Glu Lys Thr Thr Ser Gin Leu Ser Val Ala Lys Lys lie Arg Ala Thr 
1010 1015 1020 

Asp Glu Arg Asp Val 'Ala Glu Lys Val lie Gin Ser His Phe lie Pro 
1025 1030 1035 1040 

Asp Leu lie Gly Asn Leu Arg Ala Phe Ser Arg Gin Ala \<al Arg Cys 
1045 1050 1055 

Lys Cys Gly Ala Lys Tyr Arg Arg lie Pro Leu Lys Gly Lys Cys Pro 
1060 1065 1070 

Lys Cys Gly Ser Asn Leu lie Leu Thr Val Ser Lys Gly Ala Val Glu 
1075 1080 1085 

Lys Tyr Met Asp Val Ala Glu Lys Met Ala Glu Glu Tyr Asn Val Asn 
1090 1095 1100 

Asp Tyr lie Lys Gin Arg Leu Lys lie lie Lys Glu Gly lie Asn Ser 
1105 1110 1115 1120 

lie Phe Glu Asn Glu Lys Ser Arg Gin Val Lys Leu Ser Asp Phe Phe 
1125 1130 1135 

Lys lie Gly 



(2) ANGABEN ZU SEQ ID NO: 29: 

(i) SEQUENZKENNZEICHEN: 

|A) LANGE: 1434 Aminosauren 

(B) ART: Aminosaure 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: linear 

(ii) ART DES MOLEKULS: Protein 
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(vi) URSPRONLICHE HERKUNFT: 

(A) ORGANISMUS: Pyrococcus horikoshii 



(xl) SEQUENZBESCHREIBUNG: SEQ ID NO: 29: 

Met Val Leu Met Glu Leu Pro Lys Glu Met Glu Glu Tyr Phe Ser Met 
15 10 15 

Leu Gin Arg Glu lie Asp Lys Ala Tyr Glu lie Ala Lys Lys Ala Arg 
20 25 30 

Ala Gin Gly Lys Asp Pro Ser Leu Asp Val Glu lie Pro Gin Ala Ser 
35 40 45 

Asp Met Ala Gly Arg Val Glu Ser Leu Val Gly Pro Pro Gly Val Ala 
50 55 60 - 

Glu Arg He Arg Glu Leu Val Lys Glu Tyr Gly Lys Glu lie Afe Ala 
65 70 75 80 

Leu Lys lie Val Asp Glu lie lie Asp Gly Lys Phe Gly Asp Leu Gly 
85 90 95 

Ser Lys Glu Lys Tyr Ala Glu Gin Ala Val Arg Thr Ala Leu Ala lie 
100 105 110 

Leu Thr Glu Gly Val Val Ser Ala Pro lie Glu Gly lie Ala Ser Val 
115 120 125 

Lys lie Lys Arg Asn Thr Trp Ser Asp Asn Ser Glu Tyr Leu Ala Leu 
130 135 140 

Tyr Tyr Ala Gly Pro lie Arg Ser Ser Gly Gly Thr Ala Gin Ala Leu 
145 150 155 160 

Ser Val Leu Val Gly Asp Tyr Val Arg Arg Lys Leu Gly Leu Asp Arg 
165 170 175 

Phe Lys Pro Ser Glu Lys His He Glu Arg Met Val Glu Glu Val Asp 
180 185 190 

Leu Tyr His Arg Thr Val Ser Arg Leu Gin Tyr His Pro Ser Pro Glu 
1 95 200 205 

Glu Val Arg Leu Ala Met Arg Asn lie Pro lie Glu lie Thr Gly Glu 
210 215 220 
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Ala Thr Asp Glu Val Glu Val Ser His Arg Asp lie Pro Gly Val Glu 
225 230 235 240 

thr Asn Gin Leu Arg Gly Gly Ala lie Leu Val Leu Ala Glu Gly Val 
245 250 255 

Leu Gin Lys Ala Lys Lys Leu Val Lys Tyr lie Asp Lys Met Gly lie 
260 265 270 

Glu Gly Trp Glu Trp Leu Lys Glu Phe Val Glu Ala Lys Glu Lys Gly 
275 280 285 

Glu Glu Me Glu Glu Glu Gly Ser Ala Glu Ser Thr Val Glu Glu Thr 
290 295 300 

Lys Val Glu Val Asp Met Gly Phe Tyr Tyr Ser Leu Tyr Gin Lys Phe 
305 310 315 320 

Lys Ser Glu lie Ala Pro Asn Asp Lys Tyr Ala Lys Glu lie ll< Gly 
325 330 335 

Gly Arg Pro Leu Phe Ser Asp Pro Ser Arg Asn Gly Gly Phe Arg Leu 
340 345 350 

Arg Tyr Gly Arg Ser Arg Val Ser Gly Phe Ala Thr Trp Gly lie Asn 
355 360 365 

Pro Ala Thr Met lie Leu Val Asp Glu Phe Leu Ala lie Gly Thr Gin 
370 375 380 

Leu Lys Thr Glu Arg Pro Gly Lys Gly Ala Val Val Thr Pro Val Thr 
385 390 395 400 

Thr lie Glu Gly Pro lie Val Lys Leu Lys Asp Gly Ser Val Val Lys 
405 410 415 

Val Asp Asp Tyr Lys Leu Ala Leu Lys lie Arg Asp Glu Val Glu Glu 
420 425 430 

lie Leu Tyr Leu Gly Asp Ala Val lie Ala Phe Gly Asp Phe Val Glu 
435 440 445 

Asn Asn Gin Thr Leu Leu Pro Ala Asn Tyr Cys Glu Glu Trp Trp lie 
450 455 460 

Leu Glu Phe Thr Lys Ala Leu Asn Glu lie Tyr Glu Val Glu Leu Lys 
465 470 475 480 
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Pro Phe Glu Val Asn Ser Ser Glu Asp Leu Glu Glu Ala Ala Asp Tyr 
485 490 495 

Leu Glu Val Asp lie Glu Phe Leu Lys Glu Leu Leu Lys Asp Pro Leu 
500 505 510 

« 

10 Arg Thr Lys Pro Pro Val Glu Leu Ala lie His Phe Ser Glu lie Leu 
515 520 525 

Gly lie Pro Leu His Pro Tyr Tyr Thr Leu Tyr Trp Asn Ser Val Lys 
15 530 535 540 

Pro Glu Gin Val Glu Lys Leu Trp Arg Val Leu Lys Glu His Ala His 
545 550 555 560 

20 

lie Asp Trp Asp Asn Phe Arg Gly lie Lys Phe Ala Arg Arg lie Val 
565 570 575 

^ lie Pro Leu Glu Lys Leu Arg Asp Ser Lys Arg Ala Leu Glu lieu Leu 
580 585 590 

^ Gly Leu Pro His Lys Val Glu Gly Lys Asn Val lie Val Asp Tyr Pro 
595 600 605 

Trp Ala Ala Ala Leu Leu Thr Pro Leu Gly Asn Leu Glu Trp Glu Phe 
35 610 615 620 

Arg Ala Lys Pro Leu His Thr Thr lie Asp lie lie Asn Glu Asn Asn 
625 630 635 640 

40 

Glu lie Lys Leu Arg Asp Arg Gly lie Ser Trp lie Gly Ala Arg Met 
645 650 655 

45 Gly Arg Pro Glu Lys Ala Lys Glu Arg Lys Met Lys Pro Pro Val Gin 
660 665 670 

Val Leu Phe Pro lie Gly Leu Ala Gly Gly Ser Ser Arg Asp lie Lys 
^ 675 680 685 

Lys Ala Ala Glu Glu Gly Lys Val Ala Glu Val Glu lie Ala Leu Phe 

3j 690 695 700 

Lys Cys Pro Lys Cys Gly His Val Gly Pro Glu His lie Cys Pro Asn 
705 710 715 720 

60 

Cys Gly Thr Arg Lys Glu Leu lie Trp Val Cys Pro Arg Cys Asn Ala 
725 730 735 
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Glu Tyr Pro Glu Ser Gin Ala Ser Gly Tyr Asn Tyr Thr Cys Pro Lys 
740 745 750 

Cys Asn Val Lys Leu Lys Pro Tyr Ala Lys Arg Lys lie Lys Pro Ser 
755 760 765 

Glu Leu Leu Lys Arg Ala Met Asp Asn Val Lys Val Tyr Gly lie Asp 
770 775 780 

Lys Leu Lys Gly Val Met Gly Met Thr Ser Gly Trp Lys Met Pro Glu 
785 790 795 800 

Pro Leu Glu Lys Gly Leu Leu Arg Ala Lys Asn Asp Val Tyr Val Phe 
805 810 815 

Lys Asp Gly Thr lie Arg Phe Asp Ala Thr Asp Ala Pro lie Thr His 
820 825 830 

Phe Arg Pro Arg Glu lie Gly Val Ser Val Glu Lys Leu Arg G(u Leu 
835 840 845 

Gly Tyr Thr His Asp Phe Glu Gly Asn Pro Leu Val Ser Glu Asp Gin 
850 855 860 

lie Val Glu Leu Lys Pro Gin Asp lie lie Leu Ser Lys Glu Ala Gly 
865 870 875 880 

Lys Tyr Leu Leu Lys Val Ala Lys Phe Val Asp Asp Leu Leu Glu Lys 
885 890 895 

Phe Tyr Gly Leu Pro Arg Phe Tyr Asn Ala Glu Lys Met Glu Asp Leu 
900 905 910 

lie Gly His Leu Val lie Gly Leu Ala Pro His Thr Ser Ala Gly lie 
915 920 925 

Val Gly Arg lie lie Gly Phe Val Asp Ala Leu Val Gly Tyr Ala His 
930 935 940 

Pro Tyr Phe His Ala Ala Lys Arg Arg Asn Cys Phe Pro Gly Asp Thr 
945 950 955 960 

Arg lie Leu Val Gin lie Asn Gly Thr Pro Gin Arg Val Thr Leu Lys 
965 970 975 

Glu Leu Tyr Glu Leu Phe Asp Glu Glu His Tyr Glu Ser Met Val Tyr 
980 985 990 
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Val Arg Lys Lys Pro Lys Val Asp lie Lys Val Tyr Ser Phe Asn Pro 
995 1000 1005 

Glu Glu Gly Lys Val Val Leu Thr Asp lie Glu Glu Val lie Lys Ala 
1010 1015 1020 

,0 Pro Ala Thr Asp His Leu lie Arg Phe Glu Leu Glu Leu Gly Ser Ser 
1025 1030 1035 1040 

Phe Glu Thr Thr Val Asp His Pro Val Leu Val Tyr Glu Asn Gly Lys 
15 1045 1050 1055 

Phe Val Glu Lys Arg Ala Phe Glu Val Arg Glu Gly Asn lie lie lie 
1060 1065 1070 

20 

lie lie Asp Glu Ser Thr Leu Glu Pro Leu Lys Val Ala Val Lys Lys 
1075 1080 1085 

^ Me Glu Phe lie Glu Pro Pro Glu Asp Phe Val Phe Ser Leu Asn Ala 
1090 1095 1100 

3P Lys Lys Tyr His Thr Val lie lie Asn Glu Asn lie Val Thr His Gin 
1105 1110 1115 1120 

Cys Asp Gly Asp Glu Asp Ala Val Met Leu Leu Leu Asp Ala Leu Leu 
35 1125 1130 1135 

Asn Phe Ser Arg Tyr Tyr Leu Pro Glu Lys Arg Gly Gly Lys Met Asp 
1140 1145 1150 

to 

Ala Pro Leu Val lie Thr Thr Arg Leu Asp Pro Arg Glu Val Asp Ser 
1155 1160 1165 

15 Glu Val His Asn Met Asp lie Val Arg Tyr Tyr Pro Leu Glu Phe Tyr 
1170 1175 1180 

Glu Ala Thr Tyr Glu Leu Lys Ser Pro Lys Glu Leu Val Gly Val He 
* 1185 1190 1195 1200 

Glu Arg Val Glu Asp Arg Leu Gly Lys Pro Glu Met Tyr Tyr Gly Leu 
1205 1210 1215 

Lys Phe Thr His Asp Thr Asp Asp lie Ala Leu Gly Pro Lys Met Ser 
1220 1225 1230 

n 

Leu Tyr Lys Gin Leu Gly Asp Met Glu Glu Lys Val Lys Arg Gin Leu 
1235 1240 1245 
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Asp Val Ala Arg Arg lie Arg Ala Val Asp Glu His Lys Val Ala Glu 
1250 1255 1260 

Thr lie Leu Asn Ser His Leu He Pro Asp Leu Arg Gly Asn Leu Arg 
1265 1270 1275 1280 

Ser Phe Thr Arg Gin Glu Phe Arg Cys Val Lys Cys Asn Thr Lys Phe 
1285 1290 1295 

Arg Arg Pro Pro Leu Asp Gly Lys Cys Pro lie Cys Gly Gly Lys He 
1300 1305 1310 

Val Leu Thr Val Ser Lys Gly Ala lie Glu Lys Tyr Leu Gly Thr Ala 
1315 1320 1325 

Lys Met Leu Val Thr Glu Tyr Lys Val Lys Asn Tyr Thr Arg Gin Arg 
1 330 1 335 1 340 

•w. 

lie Cys Leu Thr Glu Arg Asp lie Asp Ser Leu Phe Glu Ihr Val Phe 
1345 1350 1355 1360 

Pro Glu Thr Gin Leu Thr Leu Leu Val Asn Pro Asn Asp lie Cys Gin 
1365 1370 1375 

Arg lie He Met Glu Arg Thr Gly Gly Ser Lys Lys Ser Gly Leu Leu 
1380 1385 1390 

Glu Asn Phe Ala Asn Gly Tyr Asn Lys Gly Lys Lys Glu Glu Met Pro 
1395 1400 1405 

Lys Lys Gin Arg Lys Lys Glu Gin Glu Lys Ser Lys Lys Arg Lys Val 
1410 1415 1420 

lie Ser Leu Asp Asp Phe Phe Ser Arg Lys 
1425 1430 

(2) ANGABEN ZU SEQ ID NO: 30: 

(i) SEQUENZKENNZEICHEN: 

(A) LANGE: 1092 Aminosauren 

(B) ART: Aminosaure 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: linear 

(ii) ART DES MOLEKULS: Protein 

(vi) URSPRONLICHE HERKUNFT: 

(A) ORGANISMUS: Methanobacterium thermoautotrophicum 
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(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 30: 

^ Met Met Asp Tyr Phe Asn Glu Leu Glu Arg Glu Thr Glu Arg Leu Tyr 
1 5 10 15 

Glu He Ala Arg Lys Ala Arg Ala Arg Gly Leu Asp Val Ser Thr Thr 
,u 20 25 30 

Pro Glu lie Pro Leu Ala Lys Asp Leu Ala Glu Arg Val Glu Gly Leu 
35 40 45 

15 

Val Gly Pro Glu Gly lie Ala Arg Arg lie Lys Glu Leu Glu Gly Asp 
50 55 60 

20 Arg Gly Arg Glu Glu Val Ala Phe Gin lie Ala Ala Glu lie Ala Ser 
65 70 75 80 

Gin Ala Val Pro Asp Asp Asp Pro Glu Glu Arg Glu Lys Leu Ala Asp 
85 90 95 ; 

Gin Ala Leu Arg Thr Ala Leu Ala lie Leu Thr Glu Gly Val Val Ala 
30 100 105 110 

Ala Pro Leu Glu Gly lie Ala Arg Val Arg lie Lys Glu Asn Phe Asp 
115 120 125 

35 

Lys Ser Arg Tyr Leu Ala Val Tyr Phe Ala Gly Pro lie Arg Ser Ala 
130 135 140 

40 Gly Gly Thr Ala Ala Ala Leu Ser Val Leu He Ala Asp Tyr He Arg 
145 150 155 160 

Leu Ala Val Gly Leu Asp Arg Tyr Lys Pro Val Glu Arg Glu He Glu 
45 165 170 175 

Arg Tyr Val Glu Glu Val Glu Leu Tyr Glu Ser Glu Val Thr Asn Leu 
180 185 190 

50 

Gin Tyr Ser Pro Lys Pro Asp Glu Val Arg Leu Ala Ala Ser Lys lie 
195 200 205 

Pro Val Glu Val Thr Gly Glu Pro Thr Asp Lys Val Glu Val Ser His 
210 215 220 

^ Arg Asp Leu Glu Arg Val Glu Thr Asn Asn Me Arg Gly Gly Ala Leu 
225 230 235 240 
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Leu Ala Met Val Glu Gly Val lie Gin Lys Ala Pro Lys Val Leu Lys 
245 250 255 

Tyr Ala Lys Gin Leu Lys Leu Glu Gly Trp Asp Trp Leu Glu Lys Phe 
260 265 270 

Ser Lys Ala Pro Lys Lys Gly Glu Gly Glu Glu Lys Val Val Val Lys 
275 280 285 

Ala Asp Ser Lys Tyr Val Glu Asp lie lie Gly Gly Arg Pro Val Leu 
290 295 300 

Ala Tyr Pro Ser Glu Lys Gly Ala Phe Arg Leu Arg Tyr Gly Arg Ala 
305 310 315 320 

Arg Asn Thr Gly Leu Ala Ala Met Gly Val His Pro Ala Thr Met Glu 
325 330 335 

Leu Leu Gin Phe Leu Ala Val Gly Thr Gin Met Lys lie Glu A/g Pro 
340 345 350 

Gly Lys Gly Asn Cys Val Val Pro Val Asp Thr lie Asp Gly Pro Val 
355 360 365 

Val Lys Leu Arg Asn Gly Asp Val He Arg lie Glu Asp Ala Glu Thr 
370 375 380 

Ala Ser Arg Val Arg Ser Glu Val Glu Glu lie Leu Phe Leu Gly Asp 
385 390 395 400 

Met Leu Val Ala Phe Gly Glu Phe Leu Arg Asn Asn His Val Leu Met 
405 410 415 

Pro Ala Gly Trp Cys Glu Glu Trp Trp lie Gin Thr lie Leu Ser Ser 
420 425 430 

Pro Lys Tyr Pro Gly Asp Asp Pro Leu Asn Leu Ser Tyr Tyr Arg Thr 
435 440 445 

Arg Trp Asn Glu Leu Glu Val Ser Ala Gly Asp Ala Phe Arg lie Ser 
450 455 460 

Glu Glu Tyr Asp Val. Pro Leu His Pro Arg Tyr Thr Tyr Phe Tyr His 
465 470 475 480 

Asp Val Thr Val Arg Glu Leu Asn Met Leu Arg Glu Trp Leu Asn Thr 
485 490 495 
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Ser Gin Leu Glu Asp Glu Leu Val Leu Glu Leu Arg Pro Glu Lys Arg 
500 505 510 

He Leu Glu lie Leu Gly Val Pro His Arg Val Lys Asp Ser Arg Val 
515 520 525 

Val lie Gly His Asp Asp Ala His Ala Leu lie Lys Thr Leu Arg Lys 
530 535 540 

Pro Leu Glu Asp Ser Ser Asp Thr Val Glu Ala Leu Asn Arg Val Ser 
15 545 550 555 560 

Pro Val Arg He Met Lys Lys Ala Pro Thr Tyr lie Gly Thr Arg Val 
565 570 575 

20 

Gly Arg Pro Glu Lys Thr Lys Glu Arg Lys Met Arg Pro Ala Pro His 
580 585 590 

Val Leu Phe Pro lie Gly Lys Tyr Gly Gly Ser Arg Arg Asn llf Pro 
595 600 605 

^ Asp Ala Ala Lys Lys Gly Ser lie Thr Val Glu lie Gly Arg Ala Thr 
610 615 620 

Cys Pro Ser Cys Arg Val Ser Ser Met Gin Ser lie Cys Pro Ser Cys 
35 625 630 635 640 

Gly Ser Arg Thr Val lie Gly Glu Pro Gly Lys Arg Asn lie Asn Leu 
645 650 655 

40 

Ala Ala Leu Leu Lys Arg Ala Ala Glu Asn Val Ser Val Arg Lys Leu 
660 665 670 

45 Asp Glu lie Lys Gly Val Glu Gly Met lie Ser Ala Glu Lys Phe Pro 
675 680 685 

Glu Pro Leu Glu Lys Gly He Leu Arg Ala Lys Asn Asp Val Tyr Thr 
50 690 695 700 

Phe Lys Asp Ala Thr lie Arg His Asp Ser Thr Asp Leu Pro Leu Thr 
705 710 715 720 

55 

His Phe Thr Pro Arg Glu Val Gly Val Ser Val Glu Arg Leu Arg Glu 
725 730 735 

60 

Leu Gly Tyr Thr Arg Asp Cys Tyr Gly Asp Glu Leu Glu Asp Glu Asp 
740 745 750 
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Gin lie Leu Glu Leu Arg Vai Gin Asp Val Val lie Ser Glu Asp Cys 
755 760 765 

Ala Asp Tyr Leu Val Arg Val Ala Asn Phe Val Asp Asp Leu Leu Glu 
770 775 780 

Arg Phe Tyr Asp Leu Glu Arg Phe Tyr Asn Val Lys Thr Arg Glu Asp 
785 790 795 800 

Leu Val Gly His Leu lie Ala Gly Leu Ala Pro His Thr Ser Ala Ala 
805 810 815 

Val Leu Gly Arg lie lie Gly Phe Thr Gly Ala Ser Ala Cys Tyr Ala 
820 825 830 

His Pro Tyr Phe His Ser Ala Lys Arg Arg Asn Cys Asp Ser Asp Glu 
835 840 845 

Asp Ser Val Met Leu Leu Leu Asp Ala Leu Leu Asn Phe S^r Lys Ser 
850 855 860 

Tyr Leu Pro Ser Ser Arg Gly Gly Ser Met Asp Ala Pro Leu Val Leu 
865 870 875 880 

Ser Thr Arg He Asp Pro Glu Glu lie Asp Asp Glu Ser His Asn He 
885 890 895 

Asp Thr Met Asp Met lie Pro Leu Glu Val Tyr Glu Arg Ser Phe Asp 
900 905 910 

His Pro Arg Pro Ser Glu Val Leu Asp Val He Asp Asn Val Glu Lys 
915 920 925 

Arg Leu Gly Lys Pro Glu Gin Tyr Thr Gly Leu Met Phe Ser His Asn 
930 935 940 

Thr Ser Arg He Asp Glu Gly Pro Lys Val Cys Leu Tyr Lys Leu Leu 
945 950 955 960 

Pro Thr Met Lys Glu Lys Val Glu Ser Gin He Thr Leu Ala Glu Lys 
965 970 975 

He Arg Ala Val Asp Gin Arg Ser Val Val Glu Gly Val Leu Met Ser 
980 985 990 

His Phe Leu Pro Asp Met Met Gly Asn lie Arg Ala Phe Ser Arg Gin 
995 1000 1005 
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Lys Val Arg Cys Thr Lys Cys Asn Arg Lys Tyr Arg Arg lie Pro Leu 
1010 1015 1020 

Ser Gly Glu Cys Arg Cys Gly Gly Asn Leu Val Leu Thr Val Ser Lys 
1025 1030 1035 1040 

Gly Ser Val lie Lys Tyr Leu Glu lie Ser Lys Glu Leu Ala Ser Arg 
1045 1050 1055 

Tyr Pro lie Asp Pro Tyr Leu Met Gin Arg lie Glu lie Leu Glu Tyr 
1060 1065 1070 

Gly Val Asn Ser Leu Phe Glu Ser Asp Arg Ser Lys Gin Ser Ser Leu 
1075 1080 1085 

Asp Val Phe Leu 
1090 

(2) ANGABEN ZU SEQ ID NO: 31: 

(i) SEQUENZKENNZEICHEN: 

(A) LANGE: 1263 Aminosauren 

(B) ART: Aminosaure 

(C) STRANGFORM: Einzelstrang 
ID) TOPOLOGIE: linear 

(ii) ART DES MOLEKULS: Protein 

(vi) URSPRUNLICHE HERKUNFT: 

(A) ORGANISMUS: Pyrococcus furiosus 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 31: 

Met Glu Leu Pro Lys Glu He Glu Glu Tyr Phe Glu Met Leu Gin Arg 
15 10 15 

Glu lie Asp Lys Ala Tyr Glu lie Ala Lys Lys Ala Arg Ser Gin Gly 
20 25 30 

Lys Asp Pro Ser Thr Asp Val Glu He Pro Gin Ala Thr Asp Met Ala 
35 40 45 

Gly Arg Val Glu Ser Leu Val Gly Pro Pro Gly Val Ala Gin Arg lie 
50 55 60 
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Arg Glu Leu Leu Lys Glu Tyr Asp Lys Glu lie Val Ala Leu Lys lie 
65 70 75 80 

Val Asp Glu lie lie Glu Gly Lys Phe Gly Asp Phe Gly Ser Lys Glu ' 
85 90 95 

Lys Tyr Ala Glu Gin Ala Val Arg Thr Ala Leu Ala He Leu Thr Glu 
100 105 110 

Gly lie Val Ser Ala Pro Leu Glu Gly lie Ala Asp Val Lys lie Lys 

115 120 125 ,5 

Arg Asn Thr Trp Ala Asp Asn Ser Glu Tyr Leu Ala Leu Tyr Tyr Ala 
130 135 140 

20 

Gly Pro lie Arg Ser Ser Gly Gly Thr Ala Gin Ala Leu Ser Val Leu 
145 150 155 160 

Val Gly Asp Tyr Val Arg Arg Lys Leu Gly Leu Asp Arg Phi Lys Pro 
165 170 175 

Ser Gly Lys His lie Glu Arg Met Val Glu Glu Val Asp Leu Tyr His 

180 185 190 ^ 

Arg Ala Val Ser Arg Leu Gin Tyr His Pro Ser Pro Asp Glu Val Arg 
195 200 205 

35 

Leu Ala Met Arg Asn lie Pro lie Glu lie Thr Gly Glu Ala Thr Asp 
210 215 220 

40 

Asp Val Glu Val Ser His Arg Asp Val Glu Gly Val Glu Thr Asn Gin 
225 230 235 240 

Leu Arg Gly Gly Ala lie Leu Val Leu Ala Glu Gly Val Leu Gin Lys 45 
245 250 255 

Ala Lys Lys Leu Val Lys Tyr He Asp Lys Met Gly lie Asp Gly Trp 

260 265 270 so 

Glu Trp Leu Lys Glu Phe Val Glu Ala Lys Glu Lys Gly Glu Glu lie 
275 280 285 

55 

Glu Glu Ser Glu Ser Lys Ala Glu Glu Ser Lys Val Glu Thr Arg Val 
290 295 300 

Glu Val Glu Lys Gly Phe Tyr Tyr Lys Leu Tyr Glu Lys Phe Arg Ala 
305 310 315 320 
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Glu lie Ala Pro Ser Glu Lys Tyr Ala Lys Glu lie lie Gly Gly Arg 
325 330 335 

Pro Leu Phe Ala Gly Pro Ser Glu Asn Gly Gly Phe Arg Leu Arg Tyr 
340 345 350 



10 



Gly Arg Ser Arg Val Ser Gly Phe Ala Thr Trp Ser lie Asn Pro Ala 
355 360 365 



15 



Thr Met Val Leu Val Asp Glu Phe Leu Ala lie Gly Thr Gin Met Lys 
370 375 380 



20 



Thr Glu Arg Pro Gly Lys Gly Ala Val Val Thr Pro Ala Thr Thr Ala 
385 390 395 400 

Glu Gly Pro lie Val Lys Leu Lys Asp Gly Ser Val Val Arg Val Asp 
405 410 415 



25 



Asp Tyr Asn Leu Ala Leu Lys lie Arg Asp Glu Val Glu Glu ^e Leu 
420 425 430 



30 



Tyr Leu Gly Asp Ala lie lie Ala Phe Gly Asp Phe Val Glu Asn Asn 
435 440 445 



35 



Gin Thr Leu Leu Pro Ala Asn Tyr Val Glu Glu Trp Trp lie Gin Glu 
450 455 460 



Phe Val Lys Ala Val Asn Glu Ala Tyr Glu Val Glu Leu Arg Pro Phe 
465 470 475 480 

40 

Glu Glu Asn Pro Arg Glu Ser Val Glu Glu Ala Ala Glu Tyr Leu Glu 
485 490 495 

45 Val Asp Pro Glu Phe Leu Ala Lys Met Leu Tyr Asp Pro Leu Arg Val 
500 505 510 



50 



Lys Pro Pro Val Glu Leu Ala lie His Phe Ser Glu lie Leu Glu lie 
515 520 525 



55 



60 



Pro Leu His Pro Tyr Tyr Thr Leu Tyr Trp Asn Thr Val Asn Pro Lys 
530 535 540 

Asp Val Glu Arg Leu Trp Gly Val Leu Lys Asp Lys Ala Thr lie Glu 
545 550 555 560 

Trp Gly Thr Phe Arg Gly lie Lys Phe Ala Lys Lys lie Glu lie Ser 
565 570 575 
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Leu Asp Asp Leu Gly Ser Leu Lys Arg Thr Leu Glu Leu Leu Gly Leu 
580 585 590 

Pro His Thr Val Arg Glu Gly lie Val Val Val Asp Tyr Pro Trp Ser 
595 600 605 

Ala Ala Leu Leu Thr Pro Leu Gly Asn Leu Glu Trp Glu Phe Lys Ala 
610 615 620 

Lys Pro Phe Tyr Thr Val lie Asp lie lie Asn Glu Asn Asn Gin lie 
625 630 635 640 

Lys Leu Arg Asp Arg Gly lie Ser Trp lie Gly Ala Arg Met Gly Arg 
645 650 655 

Pro Glu Lys Ala Lys Glu Arg Lys Met Lys Pro Pro Val Gin Val Leu 
660 665 670 

Phe Pro lie Gly Leu Ala Gly Gly Ser Ser Arg Asp lie Lys l^s Ala 
675 680 685 

Ala Glu Glu Gly Lys lie Ala Glu Val Glu He Ala Phe Phe Lys Cys 
690 695 700 

Pro Lys Cys Gly His Val Gly Pro Glu Thr Leu Cys Pro Glu Cys Gly 
705 710 715 720 

lie Arg Lys Glu Leu lie Trp Thr Cys Pro Lys Cys Gly Ala Glu Tyr 
725 730 735 

Thr Asn Ser Gin Ala Glu Gly Tyr Ser Tyr Ser Cys Pro Lys Cys Asn 
740 745 750 

Val Lys Leu Lys Pro Phe Thr Lys Arg Lys lie Lys Pro Ser Glu Leu 
755 760 765 

Leu Asn Arg Ala Met Glu Asn Val Lys Val Tyr Gly Val Asp Lys Leu 
770 775 780 

Lys Gly Val Met Gly Met Thr Ser Gly Trp Lys lie Ala Glu Pro Leu 
785 790 795 800 

Glu Lys Gly Leu Leu Arg Ala Lys Asn Glu Val Tyr Val Phe Lys Asp 
805 810 815 

Gly Thr Me Arg Phe Asp Ala Thr Asp Ala Pro lie Thr His Phe Arg 
820 825 830 
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Pro Arg Glu lie Gly Val Ser Val Glu Lys Leu Arg Glu Leu Gly Tyr 
835 840 845 

Thr His Asp Phe Glu Gly Lys Pro Leu Val Ser Glu Asp Gin lie Val 
850 855 860 

Glu Leu Lys Pro Gin Asp Val lie Leu Ser Lys Glu Ala Gly Lys Tyr 
865 870 875 880 

Leu Leu Arg Val Ala Arg Phe Val Asp Asp Leu Leu Glu Lys Phe Tyr 
885 890 895 

Gly Leu Pro Arg Phe Tyr Asn Ala Glu Lys Met Glu Asp Leu lie Gly 
900 905 910 

His Leu Val lie Gly Leu Ala Pro His Thr Ser Ala Gly He Val Gly 
915 920 925 

Arg lie lie Gly Phe Val Asp Ala Leu Val Gly Tyr Ala His P^ro Tyr 
930 935 940 

Phe His Ala Ala Lys Arg Arg Asn Cys Asp Gly Asp Glu Asp Ser Val 
945 950 955 960 

Met Leu Leu Leu Asp Ala Leu Leu Asn Phe Ser Arg Tyr Tyr Leu Pro 
965 970 975 

Glu Lys Arg Gly Gly Lys Met Asp Ala Pro Leu Val lie Thr Thr Arg 
980 985 990 

Leu Asp Pro Arg Glu Val Asp Ser Glu Val His Asn Met Asp Val Val 
995 1000 1005 

Arg Tyr Tyr Pro Leu Glu Phe Tyr Glu Ala Thr Tyr Glu Leu Lys Ser 
1010 1015 1020 

Pro Lys Glu Leu Val Arg Val lie Glu Gly Val Glu Asp Arg Leu Gly 
1025 1030 1035 1040 

Lys Pro Glu Met Tyr Tyr Gly lie Lys Phe Thr His Asp Thr Asp Asp 
1045 1050 1055 

lie Ala Leu Gly Pro Lys Met Ser Leu Tyr Lys Gin Leu Gly Asp Met 
1060 1065 1070 

Glu Glu Lys Val Lys Arg Gin Leu Thr Leu Ala Glu Arg lie Arg Ala 
1075 1080 1085 
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Val Asp Gin His Tyr Val Ala Glu Thr lie Leu Asn Ser His Leu lie 
1090 1095 1100 

Pro Asp Leu Arg Gly Asn Leu Arg Ser Phe Thr Arg Gin Glu Phe Arg 
1105 1110 1115 1120 

Cys Val Lys Cys Asn Thr Lys Tyr Arg Arg Pro Pro Leu Asp Gly Lys 
1125 1130 1135 

Cys Pro Val Cys Gly Gly Lys lie Val Leu Thr Val Ser Lys Gly Ala 
1140 1145 1150 

lie Glu Lys Tyr Leu Gly Thr Ala Lys Met Leu Val Ala Asn Tyr Asn 
1155 1160 1165 

Val Lys Pro Tyr Thr Arg Gin Arg He Cys Leu Thr Glu Lys Asp He 
1170 1175 1180 

Asp Ser Leu Phe Glu Tyr Leu Phe Pro Glu Ala Gin Leu Thr Leu He 
1185 1190 1195 1206'- 

Val Asp Pro Asn Asp He Cys Met Lys Met lie Lys Glu Arg Thr Gly 
1205 1210 1215 

Glu Thr Val Gin Gly Gly Leu Leu Glu Asn Phe Asn Ser Ser Gly Asn 
1220 1225 1230 

Asn Gly Lys Lys lie Glu Lys Lys Glu Lys Lys Ala Lys Glu Lys Pro 
1235 1240 1245 

Lys Lys Lys Lys Val lie Ser Leu Asp Asp Phe Phe Ser Lys Arg 
1 250 1 255 1 260 

(2) ANGABEN 2U SEQ ID NO: 32: 

(i) SEQUENZKENNZEICHEN: 

(A) LANGE: 363 Aminosauren 

(B) ART: Aminosaure 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: linear 

(ii) ART DES MOLEKOLS: Protein 

(vi) URSPRUNLICHE HERKUNFT: 

(A) ORGANISMUS: Homo sapiens 
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(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 32: 



Met Gin Ala Phe Leu Lys Gly Thr Ser lie Ser Thr Lys Pro Pro Leu 
1 5 10 15 

Thr Lys Asp Arg Gly Val Ala Ala Ser Ala Gly Ser Ser Gly Glu Asn 
20 25 30 

Lys Lys Ala Lys Pro Val Pro Trp Val Glu Lys Tyr Arg Pro Lys Cys 
35 40 45 

Val Asp Glu Val Ala Phe Gin Glu Glu Val Val Ala Val Leu Lys Lys 
50 55 60 

Ser Leu Glu Gly Ala Asp Leu Pro Asn Leu Leu Phe Tyr Gly Pro Pro 
65 70 75 80 

Gly Thr Gly Lys Thr Ser Thr lie Leu Ala Ala Ala Arg Glu Let* Phe 
85 90 95 

Gly Pro Glu Leu Phe Arg Leu Arg Val Leu Glu Leu Asn Ala Ser Asp 
100 105 110 

Glu Arg Gly lie Gin Val Val Arg Glu Lys Val Lys Asn Phe Ala Gin 
115 120 125 

Leu Thr Val Ser Gly Ser Arg Ser Asp Gly Lys Pro Cys Pro Pro Phe 
130 135 140 

Lys lie Val lie Leu Asp Glu Ala Asp Ser Met Thr Ser Ala Ala Gin 
145 150 155 160 

Ala Ala Leu Arg Arg Thr Met Glu Lys Glu Ser Lys Thr Thr Arg Phe 
165 170 175 

Cys Leu He Cys Asn Tyr Val Ser Arg lie lie Glu Pro Leu Thr Ser 
180 185 190 

Arg Cys Ser Lys Phe Arg Phe Lys Pro Leu Ser Asp Lys lie Gin Gin 
195 200 205 

Gin Arg Leu Leu Asp He Ala Lys Lys Glu Asn Val Lys lie Ser Asp 
210 215 220 

Glu Gly lie Ala Tyr Leu Val Lys Val Set Glu Gly Asp Leu Arg Lys 
225 230 235 240 
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Ala lie Thr Phe Leu Gin Ser Ala Thr Arg Leu Thr Gly Gly Lys Glu 
245 250 255 

He Thr Glu Lys Val lie Thr Asp lie Ala Gly Val lie Pro Ala Glu 
260 265 270 

Lys lie Asp Gly Val Phe Ala Ala Cys Gin Ser Gly Ser Phe Asp Lys 
275 280 285 

Leu Glu Ala Val Val Lys Asp Leu lie Asp Glu Gly His Ala Ala Thr 
290 295 300 

Gin Leu Val Asn Gin Leu His Asp Val Val Val Glu Asn Asn Leu Ser 
305 310 315 320 

Asp Lys Gin Lys Ser lie lie Thr Glu Lys Leu Ala Glu Val Asp Lys 
325 330 335 

Cys Leu Ala Asp Gly Ala Asp Glu His Leu Gin Leu He $er Leu Cys 
340 345 350 

Ala Thr Val Met Gin Gin Leu Ser Gin Asn Cys 
355 360 

(2) ANGABEN ZU SEQ ID NO: 33: 

(i) SEQUENZKENNZEICHEN: 

(A) LANGE: 329 Aminosauren 

(B) ART: Aminosaure 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: linear 

(ii) ART DES MOLEKULS: Protein 

(vi) URSPRUNLICHE HERKUNFT: 

(A) ORGANISMUS: Homo sapiens 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 33: 

Asn Leu Val Gin Cys Gly Asp Phe Pro His Leu Leu Val Tyr Gly Pro 
1 5 10 15 



Ser Gly Ala Gly Lys Lys Thr Arg lie Met Cys lie Leu Arg Glu Leu 
20 25 30 
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Tyr Gly Val Gly Val Glu Lys Leu Arg lie Glu His Gin Thr lie Thr 
35 40 45 

Thr Pro Ser Lys Lys Lys lie Glu lie Ser Thr lie Ala Ser Asn Tyr 
50 55 60 



lU 



His Leu Glu Val Asn Pro Ser Asp Ala Gly Asn Ser Asp Arg Val Val 
65 70 75 80 



IS 



lie Gin Glu Met Leu Lys Thr Val Ala Gin Ser Gin Gin Leu Glu Thr 
85 90 95 



20 



Asn Ser Gin Arg Asp Phe Lys Val Val Leu Leu Thr Glu Val Asp Lys 
100 105 110 

Leu Thr Lys Asp Ala Gin His Ala Leu Arg Arg Thr Met Glu Lys Tyr 
115 120 125 



25 



Met Ser Thr Cys Arg Leu lie Leu Cys Cys Asn Ser Thr Ser^Lys Val 
130 135 140 V 



30 



lie Pro Pro lie Arg Ser Arg Cys Leu Ala Val Arg Val Pro Ala Pro 
145 150 155 160 



35 



Ser lie Glu Asp lie Cys His Val Leu Ser Thr Val Cys Lys Lys Glu 
1 65 1 70 1 75 



Gly Leu Asn Leu Pro Ser Gin Leu Ala His Arg Leu Ala Glu Lys Ser 
180 185 190 

40 

Cys Arg Asn Leu Arg Lys Ala Leu Leu Met Cys Glu Ala Cys Arg Val 
195 200 205 

45 Gin Gin Tyr Pro Phe Thr Ala Asp Gin Glu lie Pro Glu Thr Asp Trp 
210 215 220 



50 



Glu Val Tyr Leu Arg Glu Thr Ala Asn Ala lie Val Ser Gin Gin Thr 
225 230 235 240 



55 



60 



Pro Gin Arg Leu Leu Glu Val Arg Gly Arg Leu Tyr Glu Leu Leu Thr 
245 250 255 

His Cys lie Pro Pro Glu lie lie Met Lys Gly Leu Leu Ser Glu Leu 
260 265 270 

Leu His Asn Cys Asp Gly Gin Leu Lys Gly Glu Val Ala Gin Met Ala 
275 280 285 
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Ala Tyr Tyr Glu His Arg Leu Gin Leu Gly Ser Lys Ala lie Tyr His 
290 295 300 

Leu Glu Ala Phe Val Ala Lys Phe Met Ala Leu Tyr Lys Lys Phe lie 
305 310 315 320 

Gin Asp Gly Leu Glu Gly Met Met Phe 
325 

(2) ANGABEN ZU SEQ ID NO: 34: 

(i) SEQUENZKENNZEICHEN: 

(A) LANGE: 354 Aminosauren 

(B) ART: Aminosaure 

(C) STRANG FORM: Einzelstrang 

(D) TOPOLOGIE: linear 

{ID ART DES MOLEKULS: Protein 

(vi) URSPRUNLICHE HERKUNFT: 

(A) ORGANISMUS: Homo sapiens 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 34: 

Met Glu Val Glu Ala Val Cys Gly Gly Ala Gly Glu Val Glu Ala Gin 
15 10 15 

Asp Ser Asp Pro Ala Pro Ala Phe Ser Lys Ala Pro Gly Ser Ala Gly 
20 25 30 

His Tyr Glu Leu Pro Trp Val Glu Lys Tyr Arg Pro Val Lys Leu Asn 
35 40 45 

Glu He Val Gly Asn Glu Asp Thr Val Ser Arg Leu Glu Val Phe Ala 
50 55 60 

Arg Glu Gly Asn Val Pro Asn lie lie lie Ala Gly Pro Pro Gly Thr 
65 70 75 80 

Gly Lys Thr Thr Ser lie Leu Cys Leu Ala Arg Ala Leu Leu Gly Pro 
85 90 95 

Ala Leu Lys Asp Ala Met Leu Glu Leu Asn Ala Ser Asn Asp Arg Gly 
100 105 110 
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lie Asp Val Val Arg Asn Lys lie Lys Met Phe Ala Gin Gin Lys Val 
115 120 125 

Thr Leu Pro Lys Gly Arg His Lys lie lie lie Leu Asp Glu Ala Asp 
130 135 140 

Ser Met Thr Asp Gly Ala Gin Gin Ala Leu Arg Arg Thr Met Glu lie 
145 150 155 160 

Tyr Ser Lys Thr Thr Arg Phe Ala Leu Ala Cys Asn Ala Ser Asp Lys 
165 170 175 

lie lie Glu Pro lie Gin Ser Arg Cys Ala Val Leu Arg Tyr Thr Lys 
180 185 190 

Leu Thr Asp Ala Gin lie Leu Thr Arg Leu Met Asn Val lie Glu Lys 
195 200 205 

Glu Arg Val Pro Tyr Thr Asp Asp Gly Leu Glu Ala lie lie P^e Thr 
210 215 220 

Ala Gin Gly Asp Met Arg Gin Ala Leu Asn Asn Leu Gin Ser Thr Phe 
225 230 235 240 

Ser Gly Phe Gly Phe lie Asn Ser Glu Asn Val Phe Lys Val Cys Asp 
245 250 255 

Glu Pro His Pro Leu Leu Val Lys Glu Met lie Gin His Cys Val Asn 
260 265 270 

Ala Asn lie Asp Glu Ala Tyr Lys lie Leu Ala His Leu Trp His Leu 
275 280 285 

Gly Tyr Ser Pro Glu Asp lie lie Gly Asn Me Phe Arg Val Cys Lys 
290 295 300 

Thr Phe Gin Met Ala Glu Tyr Leu Lys Leu Glu Phe lie Lys Glu lie 
305 310 315 320 

Gly Tyr Thr His Met Lys lie Ala Glu Gly Val Asn Ser Leu Leu Gin 
325 330 335 

Met Ala Gly Leu Leu Ala Arg Leu Cys Gin Lys Thr Met Ala Pro Val 
340 345 350 

Ala Ser 
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(2) ANGABEN ZU SEQ ID NO: 35: 

(i) SEQUENZKENNZEICHEN: 

(A) LANGE: 366 Aminosauren 

(B) ART: Aminosaure 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: linear 

(ii) ART DES MOLEKULS: Protein 

(vi) URSPRUNLICHE HERKUNFT: 

(A) ORGANISMUS: Escherichia coli 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 35: 
Met Lys Phe Thr Val Glu Arg Glu His Leu Leu Lys Pro beu Gin Gin 

15 10 15 

Val Ser Gly Pro Leu Gly Gly Arg Pro Thr Leu Pro lie Leu Gly Asn 
20 25 30 

Leu Leu Leu Gin Val Ala Asp Gly Thr Leu Ser Leu Thr Gly Thr Asp 
35 40 45 

Leu Glu Met Glu Met Val Ala Arg Val Ala Leu Val Gin Pro His Glu 
50 55 60 

Pro Gly Ala Thr Thr Val Pro Ala Arg Lys Phe Phe Asp lie Cys Arg 
65 70 75 80 

Gly Leu Pro Glu Gly Ala Glu lie Ala Val Gin Leu Glu Gly Glu Arg 
85 90 95 

Met Leu Val Arg Ser Gly Arg Ser Arg Phe Ser Leu Ser Thr Leu Pro 
100 105 110 

Ala Ala Asp Phe Pro Asn Leu Asp Asp Trp Gin Ser Glu Val Glu Phe 
115 120 125 

Thr Leu Pro Gin Ala Thr Met Lys Arg Leu lie Glu Ala Thr Gin Phe 
130 135 140 

Ser Met Ala His Gin Asp Val Arg Tyr Tyr Leu Asn Gly Met Leu Phe 
145 150 155 160 
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Glu Thr Glu Gly Glu Glu Leu Arg Thr Val Ala Thr Asp Gly His Arg 
165 170 175 

Leu Ala Val Cys Ser Met Pro lie Gly Gin Ser Leu Pro Ser His Ser 
180 185 190 

Val lie Val Pro Arg Lys Gly Val lie Glu Leu Met Arg Met Leu Asp 
195 200 205 

Gly Gly Asp Asn Pro Leu Arg Val Gin lie Gly Ser Asn Asn He Arg 
210 215 220 

Ala His Val Gly Asp Phe lie Phe Thr Ser Lys Leu Val Asp Gly Arg 
225 230 235 240 

Phe Pro Asp Tyr Arg Arg Val Leu Pro Lys Asn Pro Asp Lys His Leu 
245 250 255 

Glu Ala Gly Cys Asp Leu Leu Lys Gin Ala Phe Ala Arg; Ala Ala lie 
260 265 270 

Leu Ser Asn Glu Lys Phe Arg Gly Val Arg Leu Tyr Val Ser Glu Asn 
275 280 285 

Gin Leu Lys lie Thr Ala Asn Asn Pro Glu Gin Glu Glu Ala Glu Glu 
290 295 300 

He Leu Asp Val Thr Tyr Ser Gly Ala Glu Met Glu He Gly Phe Asn 
305 310 315 320 

Val Ser Tyr Val Leu Asp Val Leu Asn Ala Leu Lys Cys Glu Asn Val 
325 330 335 

Arg Met Met Leu Thr Asp Ser Val Ser Ser Val Gin lie Glu Asp Ala 
340 345 350 

Ala Ser Gin Ser Ala Ala Tyr Val Val Met Pro Met Arg Leu 
355 360 365 

(2) ANGABEN ZU SEQ ID NO: 36: 

(i) SEQUENZKENNZEICHEN: 

(A) LANGE: 363 Anr^inosauren 

(B) ART: Aminosaure 

(C) STRANGFORM: Einzelstrang 
ID) TOPOLOGIE: linear 

(ii) ART DES MOLEKULS: Protein 
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(vi) URSPRONLICHE HERKUNFT: 

(A) ORGANISMUS: Aquifex Aeolicus 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 36: 

Met Arg Val Lys Val Asp Arg Glu Glu Leu Glu Glu Val Leu Lys Lys 
15 10 15 

Ala Arg Glu Ser Thr Glu Lys Lys Ala Ala Leu Pro lie Leu Ala Asn 
20 25 30 

Phe Leu Leu Ser Ala Lys Glu Glu Asn Leu lie Val Arg Ala Thr Asp 
35 40 45 

Leu Glu Asn Tyr Leu Val Val Ser Val Lys Gly Glu Val Glu Glu Glu 
50 55 60 - 

Gly Glu Val Cys Val His Ser Gin Lys Leu Tyr Asp lie Valtys Asn 
65 70 75 80 

Leu Asn Ser Ala Tyr Val Tyr Leu His Thr Glu Gly Glu Lys Leu Val 
85 90 95 

lie Thr Gly Gly Lys Ser Thr Tyr Lys Leu Pro Thr Ala Pro Ala Glu 
100 105 110 

Asp Phe Pro Glu Phe Pro Glu lie Val Glu Gly Gly Glu Thr Leu Ser 
115 120 125 

Gly Asn Leu Leu Val Asn Gly lie Glu Lys Val Glu Tyr Ala lie Ala 
130 135 140 

Lys Glu Glu Ala Asn lie Ala Leu Gin Gly Met Tyr Leu Arg Gly Tyr 
145 150 155 160 

Glu Asp Arg lie His Phe Val Gly Ser Asp Gly His Arg Leu Ala Leu 
165 170 175 

Tyr Glu Pro Leu Gly Glu Phe Ser Lys Glu Leu Leu lie Pro Arg Lys 
180 185 190 

Ser Leu Lys Val Leu Lys Lys Leu lie Thr Gly lie Glu Asp Val Asn 
195 200 205 

lie Glu Lys Ser Glu Asp Glu Ser Phe Ala Tyr Phe Ser Thr Pro Glu 
210 215 220 
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Trp Lys Leu Ala Val Arg Leu Leu Glu Gly Glu Phe Pro Asp Tyr Met 
225 230 235 240 

Ser Val lie Pro Glu Glu Phe Ser Ala Glu Val Leu Phe Glu Thr Glu 
245 250 255 

Glu Val Leu Lys Val Leu Lys Arg Leu Lys Ala Leu Ser Glu Gly Lys 
260 265 270 

Val Phe Pro Val Lys lie Thr Leu Ser Glu Asn Leu Ala lie Phe Glu 
275 280 285 

Phe Ala Asp Pro Glu Phe Gly Glu Ala Arg Glu Glu lie Glu Val Glu 
290 295 300 

Tyr Thr Gly Glu Pro Phe Glu He Gly Phe Asn Gly Lys Tyr Leu Met 
305 310 315 320 

Glu Ala Leu Asp Ala Tyr Asp Ser Glu Arg Val Trp Phe J.ys Phe Thr 
325 330 335 V. 

Thr Pro Asp Thr Ala Thr Leu Leu Glu Ala Glu Asp Tyr Glu Lys Glu 
340 345 350 

Pro Tyr Lys Cys lie lie Met Pro Met Arg Val 
355 360 

(2) ANGABEN ZU SEQ ID NO: 37: 

(1) SEQUENZKENNZEICHEN: 

(A) LANGE: 1 1 60 Aminosauren 

(B) ART: Aminosaure 

(CI STRANGFORM: Einzelstrang 
(D) TOPOLOGIE: linear 

(ii) ART DES MOLEKULS: Protein 

(vi) URSPRUNLICHE HERKUNFT: 

(A) ORGANISMUS: Escherichia coli 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 37: 

Met Ser Glu Pro Arg Phe Val His Leu Arg Val His Ser Asp Tyr Ser 
15 10 15 
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Met lie Asp Gly Leu Ala Lys Thr Ala Pro Leu Val Lys Lys Ala Ala 
20 25 30 

Ala Leu Gly Met Pro Ala Leu Ala lie Thr Asp Phe Thr Asn Leu Cys 
35 40 45 

Gly Leu Val Lys Phe Tyr Gly Ala Gly His Gly Ala Gly lie Lys Pro 
50 55 60 

lie Val Gly Ala Asp Phe Asn Val Gin Cys Asp Leu Leu Gly Asp Glu 
65 70 75 80 

Leu Thr His Leu Thr Val Leu Ala Ala Asn Asn Thr Gly Tyr Gin Asn 
85 90 95 

Leu Thr Leu Leu lie Ser Lys Ala Tyr Glh Arg Gly Tyr Gly Ala Ala 
100 105 110 

Gly Pro He lie Asp Arg Asp Trp Leu He Glu Leu Asn Glu Gly Leu 
115 120 125 

He Leu Leu Ser Gly Gly Arg Met Gly Asp Val Gly Arg Ser Leu Leu 
130 135 140 

Arg Gly Asn Ser Ala Leu Val Asp Glu Cys Val Ala Phe Tyr Glu Glu 
145 150 155 160 

His Phe Pro Asp Arg Tyr Phe Leu Glu Leu He Arg Thr Gly Arg Pro 
165 170 175 

Asp Glu Glu Ser Tyr Leu His Ala Ala Val Glu Leu Ala Glu Ala Arg 
180 185 190 

Gly Leu Pro Val Val Ala Thr Asn Asp Val Arg Phe He Asp Ser Ser 
195 200 205 

Asp Phe Asp Ala His Glu lie Arg Val Ala lie His Asp Gly Phe Thr 
210 215 220 

Leu Asp Asp Pro Lys Arg Pro Arg Asn Tyr Ser Pro Gin Gin Tyr Met 
225 230 235 240 

Arg Ser Glu Glu Glu Met Cys Glu Leu Phe Ala Asp He Pro Glu Ala 
245 250 255 

Leu Ala Asn Thr Val Glu He Ala Lys Arg Cys Asn Val Thr Val Arg 
260 265 270 
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Leu Gly Glu Tyr Phe Leu Pro Gin Phe Pro Thr Gly Asp Met Ser Thr 
275 280 285 

Glu Asp Tyr Leu Val Lys Arg Ala Lys Glu Gly Leu Glu Glu Arg Leu 
290 295 300 

,0 Ala Phe Leu Phe Pro Asp Glu Glu Glu Arg Leu Lys Arg Arg Pro Glu 
305 310 315 320 

Tyr Asp Glu Arg Leu Glu Thr Glu Leu Gin Val lie Asn Gin Met Gly 
15 325 330 335 

Phe Pro Gly Tyr Phe Leu lie Val Met Glu Phe lie Gin Trp Ser Lys 
340 345 350 



20 



Asp Asn Gly Val Pro Val Gly Pro Gly Arg Gly Ser Gly Ala Gly Ser 
355 360 365 



25 



Leu Val Ala Tyr Ala Leu Lys lie Thr Asp Leu Asp Pro Leu Glu Phe 
370 375 380 



^ Asp Leu Leu Phe Glu Arg Phe Leu Asn Pro Glu Arg Val Ser Met Pro 
385 390 395 400 



35 



Asp Phe Asp Val Asp Phe Cys Met Glu Lys Arg Asp Gin Val lie Glu 
405 410 415 



His Val Ala Asp Met Tyr Gly Arg Asp Ala Val Ser Gin lie lie Thr 
420 425 430 

40 

Phe Gly Thr Met Ala Ala Lys Ala Val lie Arg Asp Val Gly Arg Val 
435 440 445 

45 Leu Gly His Pro Tyr Gly Phe Val Asp Arg lie Ser Lys Leu lie Pro 
450 455 460 

Pro Asp Pro Gly Met Thr Leu Ala Lys Ala Phe Glu Ala Glu Pro Gin 
^ 465 470 475 480 



55 



60 



Leu Pro Glu lie Tyr Glu Ala Asp Glu Glu Val Lys Ala Leu lie Asp 
485 490 495 

Met Ala Arg Lys Leu Glu Gly Val Thr Arg Asn Ala Gly Lys His Ala 
500 505 510 

Gly Gly Val Val lie Ala Pro Thr Lys lie Thr Asp Phe Ala Pro Leu 
515 520 525 
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Tyr Cys Asp Glu Glu Gly Lys His Pro Val Thr Gin Phe Asp Lys Ser 
530 535 540 

Asp Val Glu Tyr Ala Gly Leu Val Lys Phe Asp Phe Leu Gly Leu Arg 
545 550 555 560 

Thr Leu Thr lie lie Asn Trp Ala Leu Glu Met lie Asn Lys Arg Arg 
565 570 575 

Ala Lys Asn Gly Glu Pro Pro Leu Asp lie Ala Ala lie Pro Leu Asp 
580 585 590 

Asp Lys Lys Ser Phe Asp Met Leu Gin Arg Ser Glu Thr Thr Ala Val 
595 600 605 

Phe Gin Leu Glu Ser Arg Gly Met Lys Asp Leu lie Lys Arg Leu Gin 
610 615 620 

Pro Asp Cys Phe Glu Asp Met lie Ala Leu Val Ala Leu Phe^'Arg Pro 
625 630 635 640 

Gly Pro Leu Gin Ser Gly Met Val Asp Asn Phe lie Asp Arg Lys His 
645 650 655 

Gly Arg Glu Glu lie Ser Tyr Pro Asp Val Gin Trp Gin His Glu Ser 
660 665 670 

Leu Lys Pro Val Leu Glu Pro Thr Tyr Gly lie He Leu Tyr Gin Glu 
675 680 685 

Gin Val Met Gin lie Ala Gin Val Leu Ser Gly Tyr Thr Leu Gly Gly 
690 695 700 

Ala Asp Met Leu Arg Arg Ala Met Gly Lys Lys Lys Pro Glu Glu Met 
705 710 715 720 

Ala Lys Gin Arg Ser Val Phe Ala Glu Gly Ala Glu Lys Asn Gly lie 
725 730 735 

Asn Ala Glu Leu Ala Met Lys He Phe Asp Leu Val Glu Lys Phe Ala 
740 745 750 

Gly Tyr Gly Phe Asn Lys Ser His Ser Ala Ala Tyr Ala Leu Val Ser 
755 760 765 

Tyr Gin Thr Leu Trp Leu Lys Ala His Tyr Pro Ala Glu Phe Met Ala 
770 775 780 
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Ala Val Met Thr Ala Asp Met Asp Asn Thr Glu Lys Val Val Gly Leu 
785 790 795 800 

^ Val Asp Glu Cys Trp Arg Met Gly Leu Lys He Leu Pro Pro Asp lie 
805 810 815 

10 Asn Ser Gly Leu Tyr His Phe His Val Asn Asp Asp Gly Glu lie Val 
820 825 830 

Tyr Gly lie Gly Ala lie Lys Gly Val Gly Glu Gly Pro He Glu Ala 
15 835 840 845 

He lie Glu Ala Arg Asn Lys Gly Gly Tyr Phe Arg Glu Leu Phe Asp 
850 855 860 



20 



Leu Cys Ala Arg Thr Asp Thr Lys Lys Leu Asn Arg Arg Val Leu Glu 
865 870 875 880 



25 



Lys Leu lie Met Ser Gly Ala Phe Asp Arg Leu Gly Pro His ^rg Ala 
885 890 895 



^ Ala Leu Met Asn Ser Leu Gly Asp Ala Leu Lys Ala Ala Asp Gin His 
900 905 910 

Ala Lys Ala Glu Ala lie Gly Gin Ala Asp Met Phe Gly Val Leu Ala 
55 915 920 925 

Glu Glu Pro Glu Gin He Glu Gin Ser Tyr Ala Ser Cys Gin Pro Trp 
930 935 940 

40 

Pro Glu Gin Val Val Leu Asp Gly Glu Arg Glu Thr Leu Gly Leu Tyr 
945 950 955 960 

45 Leu Thr Gly His Pro He Asn Gin Tyr Leu Lys Glu He Glu Arg Tyr 
965 970 975 

Val Gly Gly Val Arg Leu Lys Asp Met His Pro Thr Glu Arg Gly Lys 
* 980 985 990 



55 



60 



Val He Thr Ala Ala Gly Leu Val Val Ala Ala Arg Val Met Val Thr 
995 1000 1005 

Lys Arg Gly Asn Arg lie Gly lie Cys Thr Leu Asp Asp Arg Ser Gly 
1010 1015 1020 

Arg Leu Glu Val Met Leu Phe Thr Asp Ala Leu Asp Lys Tyr Gin Gin 
1025 1030 1035 1040 
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Leu Leu Glu Lys Asp Arg lie Leu lie Val Ser Gly Gin Val Ser Phe 
1045 1050 1055 

Asp Asp Phe Ser Gly Gly Leu Lys Met Thr Ala Arg Glu Val Met Asp 
1060 1065 1070 

He Asp Glu Ala Arg Glu Lys Tyr Ala Arg Gly Leu Ala lie Ser Leu 
1075 1080 1085 

Thr Asp Arg Gin lie Asp Asp Gin Leu Leu Asn Arg Leu Arg Gin Ser 
1090 1095 1100 

Leu Glu Pro His Arg Ser Gly Thr He Pro Val His Leu Tyr Tyr Gin 
1105 1110 1115 1120 

Arg Ala Asp Ala Arg Ala Arg Leu Arg Phe Gly Ala Thr Trp Arg Val 
1125 1130 1135 

Ser Pro Ser Asp Arg Leu Leu Asn Asp Leu Arg Gly Leujile Gly Ser 
1140 1145 1150 

Glu Gin Val Glu Leu Glu Phe Asp 
1155 1160 

12) ANGABEN 2U SEQ ID NO: 38: 

(i) SEQUENZKENNZEICHEN: 

(A) LANGE: 1161 Anrjinosauren 

(B) ART: Aminosaure 

(C) STRANGFORM: Einzelstrang 

(D) TOPOLOGIE: linear 

(ii) ART DBS MOLEKULS: Protein 

(vi) URSPRONLICHE HERKUNFT: 

(A) ORGANISMUS: Aquifex Aeolicus 



(xi) SEQUENZBESCHREIBUNG: SEQ ID NO: 38: 

Met Ser Lys Asp Phe Val His Leu His Leu His Thr Gin Phe Ser Leu 
15 10 15 

Leu Asp Gly Ala lie Lys lie Asp Glu Leu Val Lys Lys Ala Lys Glu 
20 25 30 
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Tyr Gly Tyr Lys Ala Val Gly Met Ser Asp His Gly Asn Leu Phe Gly 
35 40 45 

5 

Ser Tyr Lys Phe Tyr Lys Ala Leu Lys Ala Glu Gly lie Lys Pro lie 
50 55 60 

"> He Gly Met Glu Ala Tyr Phe Thr Thr Gly Ser Arg Phe Asp Arg Lys 
65 70 75 80 

Thr Lys Thr Ser Glu Asp Asn lie Thr Asp Lys Tyr Asn His His Leu 
85 90 95 

lie Leu He Ala Lys Asp Asp Lys Gly Leu Lys Asn Leu Met Lys Leu 
20 100 105 110 

Ser Thr Leu Ala Tyr Lys Glu Gly Phe Tyr Tyr Lys Pro Arg He Asp 
115 120 125 

25 

Tyr Glu Leu Leu Glu Lys Tyr Gly Glu Gly Leu He Ala Leu Thf Ala 
130 135 140 

30 Cys Leu Lys Gly Val Pro Thr Tyr Tyr Ala Ser He Asn Glu Val Lys 
145 150 155 160 

Lys Ala Glu Glu Trp Val Lys Lys Phe Lys Asp He Phe Gly Asp Asp 
165 170 175 

Leu Tyr Leu Glu Leu Gin Ala Asn Asn lie Pro Glu Gin Glu Val Ala 
180 185 190 

40 

Asn Arg Asn Leu He Glu He Ala Lys Lys Tyr Asp Val Lys Leu lie 
195 200 205 

45 

Ala Thr Gin Asp Ala His Tyr Leu Asn Pro Glu Asp Arg Tyr Ala His 
210 215 220 

50 Thr Val Leu Met Ala Leu Gin Met Lys Lys Thr He His Glu Leu Ser 
225 230 235 240 

Ser Gly Asn Phe Lys Cys Ser Asn Glu Asp Leu His Phe Ala Pro Pro 
55 245 250 255 

Glu Tyr Met Trp Lys Lys Phe Glu Gly Lys Phe Glu Gly Trp Glu Lys 
260 265 270 

60 

Ala Leu Leu Asn Thr Leu Glu Val Met Glu Lys Thr Ala Asp Ser Phe 
275 280 285 
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Glu He Phe Glu Asn Ser Thr Tyr Leu Leu Pro Lys Tyr Asp Val Pro 
290 295 300 

Pro Asp Lys Thr Leu Glu Glu Tyr Leu Arg Glu Leu Ala Tyr Lys Gly 
305 310 315 320 

Leu Arg Gin Arg lie Glu Arg Gly Gin Ala Lys Asp Thr Lys Glu Tyr 
325 330 335 

Trp Glu Arg Leu Glu Tyr Glu Leu Glu Val lie Asn Lys Met Gly Phe 
340 345 350 



Ala Gly Tyr Phe Leu lie Val Gin Asp Phe lie Asn Trp Ala Lys Lys 
355 360 365 

Asn Asp lie Pro Val Gly Pro Gly Arg Gly Ser Ala Gly Gly Ser Leu 
370 375 380 

Val Ala Tyr Ala lie Gly lie Thr Asp Val Asp Pro He Lys His -Gly 
385 390 395 400 

Phe Leu Phe Glu Arg Phe Leu Asn Pro Glu Arg Val Ser Met Pro Asp 
405 410 415 

lie Asp Val Asp Phe Cys Gin Asp Asn Arg Glu Lys Val lie Glu Tyr 
420 425 430 

Val Arg Asn Lys Tyr Gly His Asp Asn Val Ala Gin He lie Thr Tyr 
435 440 445 

Asn Val Met Lys Ala Lys Gin Thr Leu Arg Asp Val Ala Arg Ala Met 
450 455 460 

Gly Leu Pro Tyr Ser Thr Ala Asp Lys Leu Ala Lys Leu He Pro Gin 
465 470 475 480 

Gly Asp Val Gin Gly Thr Trp Leu Ser Leu Glu Glu Met Tyr Lys Thr 
485 490 495 

Pro Val Glu Glu Leu Leu Gin Lys Tyr Gly Glu His Arg Thr Asp He 
500 505 510 

Glu Asp Asn Val Lys Lys Phe Arg Gin He Cys Glu Glu Ser Pro Glu 
515 520 525 

lie Lys Gin Leu Val Glu Thr Ala Leu Lys Leu Glu Gly Leu Thr Arg 
530 535 540 
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His Thr Set Leu His Ala Ala Gly Val Val lie Ala Pro Lys Pro Leu 
545 550 555 560 

Ser Glu Leu Val Pro Leu Tyr Tyr Asp Lys Glu Gly Glu Val Ala Thr 
565 570 575 



10 



Gin Tyr Asp Met Val Gin Leu Glu Glu Leu Gly Leu Leu Lys Met Asp 
580 585 590 



15 



Phe Leu Gly Leu Lys Thr Leu Thr Glu Leu Lys Leu Met Lys Glu Leu 
595 600 605 



20 



lie Lys Glu Arg His Gly Val Asp lie Asn Phe Leu Glu Leu Pro Leu 
610 615 620 

Asp Asp Pro Lys Val Tyr Lys Leu Leu Gin Glu Gly Lys Thr Thr Gly 
625 630 635 640 



25 



Val Phe Gin Leu Glu Ser Arg Gly Met Lys Glu Leu Leu LysJ-ys Leu 
645 650 655 



3^ Lys Pro Asp Ser Phe Asp Asp He Val Ala Val Leu Ala Leu Tyr Arg 
660 665 670 



35 



40 



Pro Gly Pro Leu Lys Ser Gly Leu Val Asp Thr Tyr He Lys Arg Lys 
675 680 685 

His Gly Lys Glu Pro Val Glu Tyr Pro Phe Pro Glu Leu Glu Pro Val 
690 695 700 

Leu Lys Glu Thr Tyr Gly Val lie Val Tyr Gin Glu Gin Val Met Lys 
705 710 715 720 



45 Met Ser Gin lie Leu Ser Gly Phe Thr Pro Gly Glu Ala Asp Thr Leu 
725 730 735 



50 



Arg Lys Ala lie Gly Lys Lys Lys Ala Asp Leu Met Ala Gin Met Lys 
740 745 750 



55 



60 



Asp Lys Phe lie Gin Gly Ala Val Glu Arg Gly Tyr Pro Glu Glu Lys 
755 760 765 

lie Arg Lys Leu Trp Glu Asp lie Glu Lys Phe Ala Ser Tyr Ser Phe 
770 775 780 

Asn Lys Ser His Ser Val Ala Tyr Gly Tyr lie Ser Tyr Trp Thr Ala 
785 790 795 800 



65 
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Tyr Val Lys Ala His Tyr Pro Ala Glu Phe Phe Ala Val Lys Leu Thr 
805 810 815 

Thr Glu Lys Asn Asp Asn Lys Phe Leu Asn Leu He Lys Asp Ala Lys ^ 
820 825 830 

Leu Phe Gly Phe Glu lie Leu Pro Pro Asp lie Asn Lys Ser Asp Val 

835 840 845 ^" 

Gly Phe Thr lie Glu Gly Glu Asn Arg lie Arg Phe Gly Leu Ala Arg 

850 855 860 i3 

lie Lys Gly Val Gly Glu Glu Thr Ala Lys lie lie Val Glu Ala Arg 
865 870 875 880 

20 

Lys Lys Tyr Lys Gin Phe Lys Gly Leu Ala Asp Phe lie Asn Lys Thr 
885 890 895 

Lys Asn Arg Lys lie Asn Lys Lys Val Val Glu Ala Leu Val l^ys Ala 25 
900 905 910 

Gly Ala Phe Asp Phe Thr Lys Lys Lys Arg Lys Glu Leu Leu Ala Lys 

915 920 925 ^ 

Val Ala Asn Ser Glu Lys Ala Leu Met Ala Thr Gin Asn Ser Leu Phe 
930 935 940 

35 

Gly Ala Pro Lys Glu Glu Val Glu Glu Leu Asp Pro Leu Lys Leu Glu 
945 950 955 960 

40 

Lys Glu Val Leu Gly Phe Tyr lie Ser Gly His Pro Leu Asp Asn Tyr 
965 970 975 

Glu Lys Leu Leu Lys Asn Arg Tyr Thr Pro lie Glu Asp Leu Glu Glu 45 
980 985 990 

Trp Asp Lys Glu Ser Glu Ala Val Leu Thr Gly Val lie Thr Glu Leu 

995 1000 1005 50 

Lys Val Lys Lys Thr Lys Asn Gly Asp Tyr Met Ala Val Phe Asn Leu 
1010 1015 1020 

55 

Val Asp Lys Thr Gly Leu lie Glu Cys Val Val Phe Pro Gly Val Tyr 
1025 1030 1035 1040 

Glu Glu Ala Lys Glu Leu lie Glu Glu Asp Arg Val Val Val Val Lys ^ 
1045 1050 1055 
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Gly Phe Leu Asp Glu Asp Leu Glu Thr Glu Asn Val Lys Phe Val Val 
1060 1065 1070 

Lys Glu Val Phe Ser Pro Glu Glu Phe Ala Lys Glu Met Arg Asn Thr 
1075 1080 1085 

Leu Tyr lie Phe Leu Lys Arg Glu Gin Ala Leu Asn Gly Val Ala Glu 
1090 1095 1100 

Lys Leu Lys Gly lie lie Glu Asn Asn Arg Thr Glu Asp Gly Tyr Asn 
1105 1110 1115 1120 

Leu Val Leu Thr Val Asp Leu Gly Asp Tyr Phe Val Asp Leu Ala Leu 
1125 1130 1135 

Pro Gin Asp Met Lys Leu Lys Ala Asp Arg Lys Val Val Glu Glu lie 
1140 1145 1150 

Glu Lys Leu Gly Val Lys Val He lie 
1155 1160 



Patentanspriiche 

1. Thcrnioslabilcr in viiro Komplex zur Template-abhangigen Elongation von Nukleinsauren, umfassend ein ther- 
mos! abilcs Cjlciiklaniiiicrprotein, welches mit einem thermostabilen Polymeraseaktivitat-aufweisenden Elongati- 
onsproicin vcrbunJen isl. 

2. TheniiosiabiicT Komplex nach Anspruch 1, dadurch gekennzeichnet, dass das Gleitklammerprotein und das 
Elongationsproiein iiber ein Kopplungsprotein verbunden sind. 

3. Themiosiabiler Komplex nach Anspruch 1 oder 2, dadurch gekennzeichnet, dass das Gleitklammerprotein und 
das Elongationsprotein aus Archaebakterien stamraen. 

4. Tbennostabiler Komplex nach Anspruch 1, 2 oder 3, dadurch gekennzeichnet, dass das Gleitklammerprotein 
eine ringfbniiige Struklur aufweist, welche die Template-Nukleinsaurestrange ganz oder teilweise umschlieBl. 

5. Thermos tabiJer Komplex nach einem der Anspriiche 1 bis 4, dadurch gekennzeichnet, dass das Gleitklammer- 
protein eine oder beide der folgenden Aminosaurekonsensussequenzen aufweist: 

SEQIDNO: 39 

[G A V L I M P F WQ-D-X-X-X-(G A V L I M P F \V]-X-X- [G A V L I M P F W]-X-[G A V L I M P F W]-X-[G A 
V L I M P F WJ-X-X-X-X-F-X-X-Y-X-X-D und/oder 
SEQIDNO: 40 

[G A V L I M P F W]-X(3)-L-A-P-[K R H D E]-[G A V L I M P F W]-E. 

6. Themiostabiler Komplex nach einem der Anspriiche 1 bis 5, dadurch gekennzeichnet, dass das Gleitklammer- 
protein zu der menschlichen (Eukaryonten) PCNA Aminosauresequenz (SEQ ID NO: 11) auf einer Lange von min- 
destens 100 Aminosauren bei einem Sequenzalignment eine mindestens 20%ige Sequenzidentitai aufweist und/ 
Oder das Gleitklammerprotein zu der bakterielle ^-clamp Sequenz aus E.coli (Eubakteria) (SEQ ID NO: 35) auf ei- 
ner Lange von mindestens 100 Aminosauren bei einem Sequenzalignment eine mindestens 20%5ge Sequenzidenti- 
tai aufweist und/oder das Gleitklammerprotein zu der Aminosauresequenz des PCNA Homologen aus Archaeoglo- 
bus Fulgidus (Archaea) (SEQ ID NO: 12) auf einer Lange von mindestens 100 Aminosauren bei einem Sequenza- 
lignment eine mindestens 20%ige Sequenzidentitai aufweist. 

7. Jliermostabiler Komplex nach einem der vorhergehenden Anspriiche, dadurch gekennzeichnet, dass das Gleit- 
klammerprotein mit einem aus einem Alignment aus Abb. 12 generienen Hidden Markow Modell einen Score von 
mindestens 20 ergibt und/oder wobei das Gleitklammerprotein mit einem aus einem Alignment aus Abb* 1 3 gene- 
rierten Hidden Markow Modell einen Score von mindestens 25 ergibt. 

8. Thermostabiler Komplex nach einem der vorhergehenden Anspriiche, dadurch gekennzeichnet^ dass das Gleit- 
klammerprotein ausgewahlt ist aus AF0335 aus Archaeoglobus Fulgidus, MJ0247 aus Methanococcus jannaschii, 
PHLA008 aus Pyrococcus horikoschii, MTH1312 aus Methanobacterium thermoautotrophicus sowie AE000761 7 
aus Aquifex aeolicus. 

9. Thermostabiler Komplex nach einem der vorhergehenden Anspriiche, dadurch gekennzeichnet, dass das Elon- 
gationsprotein eine 5 -3*-Polymeraseakdviiat oder eine Reverse-IYanskripiaseaktivitai aufweist. 

10. Thermostabiler Komplex nach Anspruch 9, dadurch gekennzeichnet, dass das Elongationsprotein mindestens 
eine der folgenden Konsensussequenzen beinhaltet und an nicht mehr als vier Positionen von dieser Sequenz ab- 
weicht: 
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SEQIDNO:44 

D-[GAVLIMPFW]-[GAVLIMPF W]-X-X-Y-N-X-X-X-F-D-X-P-Y-[G A V K U N O F W]-X-X-R-A 
SEQIDNO: 45 

A-[GAVLIMPFW]-R-T-A-[FA VLIMPF\Vn-A-[GAVLIMPFW]-[GAVLIMPFW]-T-E-G-[GAV 
L I M P F W]- V-X-A-P.[G A V L I M P F W]-E-G-I-A-X-V-[K R H D E]-I 5 
SEQIDNO: 46 

[G A V L I M P F W]-P-V-G-[G A V L I M P F W]^-R-G-S-X-[G A V L I M P F WJ-G-S-fG AVKUNOFW> 
V-A-X. A-(G A V L I M P F W].X-I-T-D-[G A V K U N O F W1-I>P-[G A V L I M P F W]-X-X-X-IG A V L I M P 
F WJ-L-F-E-R-KL-N-P-E-R-tG A V L 1 M P F Wj-S-M-P-D. 

11. Thermostabiler Komplex nach Anspnich 9, dadurch gekennzeichnet, dass das Elongaiionsprotein eine zu der lo 
menschlichen (Eukaryonten) Aminosaurcsequenz (SEQ ID NO: 22) auf einer Lange von mindestens 200 Amino- 
^uren bei einem Sequenzalignmenl eine mindestens 20%ige Sequenzidentitat aufweist und/oder zur archaebakte- 
riellen Aminosaunesequenz (SRQ TD NO: 27) auf einer T^nge von mindestens 400 Aminosauren bei einem Sequen- 
zalignment eine mindestens 25%ige Sequenzidentitat aufweist und/oder zur eubakteriellen Aminosauresequenz 
(SEQ ID NO: 37) auf dner Lange von mindestens 300 Aminosauren bei einem Sequenzalignment eine mindestens is 
25%ige Sequenzidentitat aufweist. 

12. Thermostabiler Komplex nach Anspnich 9, dadurch gekennzeichnet dass das Elongaiionsprotein mit einem 
aus einem Alignment aus Abb. 17 generierten Hidden Markow Model! einen Score von mindestens 20 eigibl und/ 
Oder wobei das Hongationsprotein mit einem aus einem Alignment aus Abb. 18 generierten Hidden Markow Mo- 
dell einen Score von mindestens 35 ergibt und/oder wobei das Elongaiionsprotein mit einem aus einem Alignment 20 
aus Abb. 19 generierten Hidden Markow Modell einen Score von mindestens 20 ergibL 

13. Thermostabiler Komplex nach einem der vorhergehenden Anspruche, dadurch gekennzeichnet, dass das Elon- 
gaiionsprotein ausgewahlt ist aus AF0497 oder AF1722 aus Archaoglobus fulgidus, MJ0885 oder MJ1630 aus Me- 
thanococcus jannaschii, PHBT047 oder PHBN021 aus Pyrococcus horikoschii, MTHUOS oder MTH1536 aus Me- 
thanobacierium thermoautotrophicus sowie PFU0RF3 aus Pyrococcus furiosus. ' " 25 

14. Thennosiabiler Komplex nach einem der vorhergehenden Anspruche, dadurch^ekennzeichnet, dass das Kopp- 
lungsprotcin die folgcndc Konscnsusscqucnz bcinhaltcl und an nicht mchr als vicr Positioncn von dicscr Scqucnz 
abweicht: 

(SEQIDNO: 43) 

[FLF[G A V L I M P F W]-X-X-IG A V L I M P F W]-X-G-X( 1 3)-[G A V L 1 M P F W]-X-[YR>[G A V L I M P F 30 
W]-X-1G A V L I M P F W]- A-G-[DN]-[G AVLIMPFW]-[GAVLIMPF W]-[DS]. 

1 5. Thermostabiler Komplex nach einem der vorhergehenden Anspruche, dadurch gekennzeichnet, dass das Kopp- 
lungsprotein zu der menschlichen (Eukaryonten) Aminosauresequenz (SEQ ID NO: 16) auf einer Lange von min- 
destens 150 Aminosauren bei einem Sequenzalignment eine mindestens 18%ige Sequenzidentitat aufweist 

1 6. Thermostabiler Komplex nach einem der vorheigehenden Anspruche, dadurch gekennzeichnet, dass das Kopp- 35 
lungsprotein mit einem aus einem Alignment aus Abb. 16 generierten Hidden Markow Modell einen Score von 
mindestens 10 ergibt 

1 7. Thermostabiler Komplex nach einem der vorheigehenden Anspruche, dadurch gekennzeichnet dass das Kopp- 
lungsprotein ausgewahlt ist aus AF1790 aus Archaeoglobus fulgidus, MJ0702 aus Methanococcus jannaschii, 
PHBN023 aus Pyrococcus horikoschii, MTH1405 aus Meihanobacterium thermoaulothrophicum sowie PFU0RF2 40 
aus Pyrococcus furiosus. 

1 8. Thermostabiler Komplex nach einem der vorhergehenden Anspruche, dadurch gekennzeichnet dass der Kom- 
plex mit einem Protein assoziiert ist welches als Gleitklammerlader fungiert. 

19. TTiermostabiler Komplex nach einem der vorhergehenden Anspruche, dadurch gekennzeichnet, dass der Kom- 
plex mit ATP Oder einem anderen Kofaktor assoziiert vorliegt 45 

20. Thermostabiler akzessorischer in vitro-Komplex, dadurch gekennzeichnet dass er ein Gleitklammerprotein 
und ein Kopplungsprolein enlhak, wie sie jeweils in einem der vorhergehenden Anspruche definiert sind. 

21. Rekombinante DNA-Sequenz, dadurch gekennzeichnet dass sie fiir einen thermostabilen in vitro-Komplex ge- 
mafi einem der Anspruche 1 bis 19 oder fiir einen thermostabilen akzessorischen Komplex nach Anspruch 20 co- 
dieri. 50 

22. Vektor, dadurch gekennzeichnet dass er eine fiir ein Gleitklammerprotein und ein Kopplungsprotein oder/und 
ein Elongaiionsprotein codierende, rekombinante DNA-Sequenz enihalt. 

23. Vektor nach Anspruch 22, dadurch gekennzeichnet dass er zusatzlich geeignete Restrikiionsschnittstellen zur 
Insertion weiterer DNA-Sequenzen in einer solchen Anordnung enthalt dass sich ein Fusionsprotein aus Gleitklam- 
merprotein und dem Expressionsprodukt der weiteren DNA-Sequenzen eigibt. 55 

24. Vektor nach Anspruch 22 oder 23, dadurch gekennzeichnet dass er geeignete; die Expression der DNA-Se- 
quenz(en) steuemde Promotor- oder/und Operatorbereiche enthalt 

25. Vektor nach Anspruch 24, dadurch gekennzeichnet dass er mehrere Promotor- oder/und Operatorbereiche zur 
getrennten Expression mehrerer DNA-Sequenzen enthalt. 

26. Vektor nach einem der Anspruche 22 bis 25, dadurch gekennzeichnet, dass er reprimier- und induzierbare Pro- m 
motor- oder/und Operatorbereiche enthalt. 

27. Vektor nach einem der Anspruche 22 bis 26, dadurch gekennzeichnet dass er eine DNA-Sequenz nach An- 
spruch 21 enthalt. 

28. Wirtszelle. dadurch gekennzeichnet dass sie mit einem oder mehreren Vektoren nach einem der Anspruche 22 

bis 27 u^nsformien ist. 55 

29. Verfahren zur Herstellung eines thermostabilen in vitro- Komplexes gemaB einem der Anspruche 1 bis 19 oder 
eines thermostabilen, akzessorischen in vitro-Komplexes gemaB Anspruch 20, dadurch gekennzeichnet dass man 
eine entsprechende rekombinante DNA-Sequenz gemaS Anspruch 21 oder einen oder mehrerc entsprechende \fek- 
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toren gemaB einem der Anspriiche 22 bis 27 in eine Wrtszelle einbringt, die Expression der Proteine bewirkt und 
diese aus dem Kulturmedium Oder nach Zellaufschluss isoliert und gegebenenfalls noch mil weiteren Komplexbe- 
standteilen koppelt. 

30. Verwendung eines thermostabilen, akzessorischen in vitro-Komplexes nach Anspruch 20 zur Herstellung eines 
thermostabilen in vitro-Komplexes nach einerh der Anspriiche 1 bis 19, dadurch gekennzeichnet, dass man den ak- 
zessorischen Komplex mil einem Polymeraseaktivitat aufweisenden Elongationsprotein koppelt. 

31. Verfahren zur Templaie-abhangigen Elongation von Nukleinsauren, wobei die Nukleinsaure notigenfalls dena- 
turiert, mil mindestens einem Primer unler Hybridisierungsbedingungen versehen wird, wobei der Primer komple- 
mentar zu einem flankierenden Bereich einer gewiinschten Nukleinsauresequenz des 'Ifemplate-Strangs ist und mit 
Hilfe einer Polymerase in Anwesenheil von Nukleotiden eine Primer-Elongation erfolgt, dadurch gekennzeichnet, 
dass als Polymerase ein thermostabiler in vitro-Komplex gemaB einem der Anspriiche 1 bis 19 verwendet wird. 

32. Verfahren nach Anspruch 31, dadurch gekennzeichnet, dass man zur Amplifikation von DNA-Sequenzen zwei 
die gewiinschte Nukleinsauresequenz flankierende Primer und Deoxynukleotide oder/und Derivate davon oder/und 
Ribonukleotide oder/und Derivate davon verwendet. 

33. Verfahren nach Anspruch 32, dadurch gekennzeichnet, dass man eine Polymerase-Ketten-Reaktion durchfuhrt. 

34. Verfahren nach Anspruch 31, dadurch gekennzeichnet, dass man zur Reversen-Transkription von RNA in DNA 
einen thermostabilen in viu-o-Kompiex gemaB einem der Anspriiche 1 bis 19 einsetzt, dessen Elongationsprotein 
Reverse-TVanskriptase-Aktivitataufweisi. 

35. Verfahren nach einem der Anspriiche 31 bis 34, dadurch gekennzeichnet, dass man zur Sequenaerung von Nu- 
kleinsauren ausgehend von einem Primer, der zu einem der zu sequenzierenden Nukleinsaure benachbarten Bereich 
koraplementar ist, eine Template-abhangige Elongation bzw. Reverse-Transkription unter Verwendung von Deox- 
ynukleotiden und Dideoxynukleotiden oder deren jeweiligen Derivaten gemaB der Methode von Sanger durchfuhrt. 

36. Verfahren nach einem der Anspriiche 31 bis 35, dadurch gekennzeichnet, dass man bei der Elongation der Nu- 
kleinsauren Markierungen einfiigt. 

37. Verfahren nach Anspruch 36, dadurch gekennzeichnet, dass man markierte Primer oder/und mariderte Deox- 
ynukleotide oder/und deren Derivate oder/und markierte Dideoxynukleolide (^^er/und deren Derivate oder/und 
markierte Ribonukleotide oder/und deren Derivate einsetzt. 

38. Verfahren zur Markierung von Nukleinsauren durch Erzeugung einzelner Briiche in Phosphodiesterbindungen 
der Nukleinsaurekette und Ersatz eines Nukleotids an den Bruchstellen durch ein markiertes Nukleoiid mit Hilfe ei- 
ner Polymerase, dadurch gekennzeichnet, dass als Polymerase ein thermostabiler in vitro-Komplex nach einem der 
Anspriiche 1 bis 1 9 eingesetzt wird. 

39. Reagenzien-Kii zur Elongation oder/und Amplifikation oder/und Reversen-'Mnskription oder/und Sequenzie- 
rung oder/und Markierung von Nukleinsauren, enthaltend in einem oder mehreren getrennten Behaltem 

a) einen tliermostabilen in vitro-Komplex gemaB einem der Anspriiche 1 bis 19 oder 

b) einen thcnifiostabilen akzessorischen in vitro-Komplex gemaB Anspruch 20 und gegebenenfalls separat da- 
von ein Polymeraseaktivitat aufweisendes Elongationsprotein, 

sowie gegebenenfalls Primer, Puffersubstanzen, Nukleotide, ATP, einen oder mehrere andere Kbfaktoren oder/und 
Pyrophosphat. 

40. Kit nach Anspruch 39, dadurch gekennzeichnet, dass er zur Amplifikation von Nukleinsauren neben den Sub- 
stanzen a) oder b), welche 5'-3'-Polymeraseakti vital aufweisen, Deoxynukleotide oder/und Derivate davon enthalt. 

41 . Kit nach Anspruch 39, dadurch gekennzeichnet, dass er zur Reversen Transkription Substanzen a) oder b) ent- 
halt, welche Reverse Transkriptase-Aktivitat aufweisen, sowie Deoxynukleotide oder/und Derivate davon. 

42. Kit nach einem der Anspriiche 39 bis 41, dadurch gekennzeichnet, dass er zur Sequenzierung neben Deoxynu- 
kleotiden oder Ribonukleotiden oder/und deren Derivaten, Dideoxynukleotide oder/und deren Derivate enthalt. 

43. Kit nach einem der Anspriiche 39 bis 42, dadurch gekennzeichnet, dass er Primer oder/und Deoxynukleotide 
oder/und Dideoxynukleolide in markierter Form enthalt. 



Hierzu 15 Seite(n) Zeichnungen 



19840771 A 1_L> 



136 



OOCID: <0E 19840771A1J, 



ZEICHNUNGEN SEITE 1 



Nummer: 
Int. CI 7: 

Offenlegungstag: 



DE 19840 771 A1 
C12N W12 

10. Februar2000 



C 

a 
2 

< 



iiii 
Slip 

< 0. b. 






! 


1 
1 

I 
j 

1 

i 

i 


fs4 W 2 






fn 2 

CO 

CU Nw-* 


§ I 

ill 

<Sp: 


-pi 

o — 
X ^ C 


2 

o ^ 
<^ X 


in 

6 

sis 


o 

v*^ JrT z 
S 2 Q 

^ s 


^0 

d 

«> S ^ 

X ^5 


O 

O X 
\0 rri ^ 

— D 

w-> — X 

X C 


= Sis 
i SS B 
< (Cs b 


6 

rsi ^ ^ 

^ 
^ ox bd 


- S 
o ;;^x 

S ^ O' 
r* rx a> 


O 

T wn 

in 

CU ' 


„ 6 
^ - z 

iT' r>4 CD 
P>« 


cs 
6 

c 

^ rs» c/) 

CU 4 


5\ 

6 

^ ^z 

ar W 

^ W5 


Ardiacon 

Mcthanococcus 

Jaiinasdui 


(N — ■ ^ 

n cs W 


00* 

d 

2S ^ C 


^6 

cs S 
^ C 


6 

o — 

ry 


rx 
6 

2 

oo 

oo vo 
§»« 


Se* 
§2 

X o ^ 

2 *Q, SI 


J 

i 8-i 5 1 
^ ^ f 3 t 




6 

:: 

s: g; w 


»ri fs W 
C ^ C 


6 
cz 

O M S 

<7w — ^ 

j:; ^ o 

S w 


6 

a\ — ^ 

^ ^ c 
< ci^^ 


6 

2 

€N 2 

^ S 


Homo snpicns 


2 ^ 5 *^ 5 ^ 
1 c> i ©2 6 

X O X g X 9 s o 
< i2.< 52.< £^< 52. 


X 2 
»^ o* 
u w 


2: ^ 
i 6 

k!££. 


z^ 

ii. 

if 


< 

5 6 

is 

DS2. 






8 

1 a 
§ § s 

iff 

Ego 
U \3i ^ 


t 

b 
c 

3 - 


b 
c 

ii 


b 
c 

5 

*D 


•§•§ 
^5 


•Ii 

111 5^ 


a< 



902 066/569 



SDOCIE>: <DE 19840771A1 J_> 



ZEICHNUNGENSEITE2 



Nummer: 
Int C\J: 

Offenlegungstag: 



DE 19840 771 A1 
C12N S/12 

10. Februar2000 



Abbildung 2: 



Koziq>ODeDten des 

EloDgatioQS- 

holoenzymes 


Eubakteria 
Escheridiia coli 


Aquifex Aeolicus 
(%Idemitat | 
Aligomentlange)* 


Glettklaimner 


DP3B_EC0LI 
(SEQ ID NO J5] 


AASEQ93 
(28% 1370) 
(SEQ ID NO:36] 


EioDgaiionsprot ein 


DP3A_EC0LI 
(SEQ ID NO:371 


AASEQ50 
(37% 11154) 
|SEQ ID NO:381 
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ZEICHNUNGENSEITE3 



Nummen 
Int. C17: 

Offenlegungstag: 



DE 19840771 A1 
C 12 N 9/12 

10. Februar 2000 




902 066/569 



SDOCID: <DE 1 9840771 A1J_> 



ZEICHNUNGENSEniE4 



Mummer: 
Int Cl.^: 

Offenlegungstag: 



DE19840 771A1 
C12N 9/12 

10. Februar2000 



Abbi Idling 4 ; 



Gleitklammer Region 1: 



PCNA_HUMAN 
PCNA_METJA 



MDSSPIVSLVQLTLRSEGFDTYRCD 
MDPSHVALVSLEI PRLAFEEYEAD 
LDRSHITYVHLELKAELFDEYVCD 
MDPSRWLIDLNLPSSIFSKYEVD 
VDPANVAMVIVDI PKDSFEVYNID 



MTH1312 
PHLA008 
AF0335 



Konsensus : $DXXX$XX$X$X$XXXXFXXYXXD 

Pattern : [GAVLIMPFW] -D-X-X-X- [GAVLIMPFW] -X-X- [GAVLIMPFW] -X- 
[GAVLIMPFW] -X" [GAVLIMPFW] -X-X-X-X-F-X-X-Y-X-X-D 



Gleitklammer Region 2: 

PCNA_HUMAN : LKYYLAPKIE 

PCNA_METJA : LTFLLAPRIE 

MTH1312 : LSFLLAPRIE 

PHLA008 : LTFLLAPRVE 

AF0335 : VEYILAPRIE 

Konsensus : $XXXLAP£c$E 

Pattern : [GAVLIMPFW] -X ( 3 ) -L-A-P- [KRHDE] - [GAVLIMPFW] -E 



aXXJJD: <0E t9e40771Al_L> 
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2EICHNUNGEN SEITE5 Nummer DE 19840 771 A1 

lnt.Cl7: C12NS/12 

Offenlegungstag: 10. Februar 2000 

Albbildunq 5t 



Koppelnde Untereinheit : 



PHBN023 

PfuORF2 

DPD2_HUMAN. 

MTH1405 

AF1790 

MJ0702 



FLEWLNGYVESKEEEEI VSRIRYLI lAGDWD 
FLEWLNGNVETKEEEEIVSRVKYLIIAGDWD 
LVDWTGQLGDEGEQCSAAHVSRVILAGNLLS 
FVKWINGDFGSEEQRSLAADVKYLWAGDIVD 
FVRWLKGEVGGKKSQNLAEKVKYIVIAGDIVD 
FIRFLNGDVDNELEEKWSRLKYICIAGDLVD 



Konsensus : F$XX$XGXXXXXXXXXXXXX$XY$X$AGDS$D 
L R N S 



Pattern : [FL] - [GAVLIMPFW] -X-X- tGAVLIMPFW] -X-G-X (13) - [GAVLIMPFW] -X- 
t YR] - [GAVLIMPFW] -X- [GAVLIMPFW] -A-G- [DN] - [GAVLIMPFW] - [GAVLIMPFW] - [DS] 



AbbildTina 6; 



GleitJclainmerlader: 



AC11_HUMAN 

AF2060 

MTH0241 

PHBN012 

MJ1422 

Konsensus 



CNYLSKI IPALQSRCTRFRFGPL 
CNYVSRI lEPIQSRCAVFRFKPV 
CNYSSKIIDPIQSRCAIFRFLPL 
CNYSSKIIEPIQSRCAIFRFRPL 
CNYPSKIIPPIQSRCAVFRFSPL 

CNYXS &! IX $ $QSRCXXFRFXP$ 



Pattern : C-N-Y-X-S- [KRHDE] -I-I-X- [GAVLIMPFW] 
X-X-F-R-F-X-P- [GAVLIMPFW) 



[GAVLIMPFW] -Q-S-R-C- 



3EXX:tD: <DE ie840771A1_L> 
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ZEICHNUNGEN SEITE 6 



Nummer: 
Int. C\7: 

Offenlegungstag: 



DE 19840 771 A1 
C12N 9^12 

10. Februar 2000 



Abbildimg 7 : 



Gleitklammerlader 2: 



AC15_HUMAN 

MJ0884 

AF1195 

MTO0240 

PHBN013 

Konsensus 

Pattern 
(GAVLIMPFWl 



KAALLSGPPGVGKTTTASLV 
KPILLVGPPGCGKTTLAYAL 
KPLLLAGPPGVGKTSLALAL 
KPLLLVGPPGTGKTTLAHI I 
KALLLAGPPGSGKTTTVYAL 

KXXLLXGPPGXGKT§X$XX$ 

K-X-X-L-L-X-G-P-P-G-X-G-K-T- [STNQYC] -X- [GAVLIMPFW] -X-X- 



Abbildung 8 : 



Elongationprotein 1 



DPOD_HUMAN 

DPOL_ARCFU 

MTH1208 

PHBT047 

MJ0885 

Konsensus 



DVITGYNIQNFDLPYLISRA 
DI I VGYNQDAFDWP YLRKRA 
DILVGYNSDNFDFPYITRRA 
DVI ITYNGDNFDFPYLLKRA 
DVIYTYNGDNFDFPYLKARA 

D$$XXYNXXXFDXPY$XXRA 



Pattern ; D- (GAVLIMPFW) - [GAVLIMPFW) -X-X-Y-N-X-X-X-F-D-X-P-Y- 

[ GAVLIMPFW] -X-X-R-A 



30OCK): <DE 1964077 tAl_r_> 
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ZEICHNUNGENSEITE7 



Nummer 
Int. C\7: 

Offenlegungstag: 



DE 19840 771 A1 

C12N S/12 

10. Februar2000 



Abbilduna 9i 



Elongationsprotein 2 



AF1722 

MJ1630 

PfuORF3 

MTH1536 

PHBN021 



AVRTAVAIMTEGWAAPIEGIARVRI 
AVRTALAVLTEGIVAAPLEGIADVKI 
AVRTALAILTEGIVSAPLEGIADVKI 
ALRTALAILTEGWAAPLEGIARVRI 
AVRTALAILTEGWS AP lEGI ASVKI 



Konsensus: A$RTA$A$$TEG$VXAP$EGIAXV&I 

Pattern : A- [GAVLIMPFW] -R-T-A- [GAVLIMPFWJ -A- [GAVLIMPFW] - f GAVLIMPFW] 
T-E-G- [GAVLIMPFW] -V-X-A-P- (GAVLIMPFW] -E-G-I-A-X-V- [KRHDE] ~I 



Abbildxing 10 1 



Elongationsprotein (Eubakteria) 

DP3A.EC0LI : VPVGPGRGSGAGSLVAYALKITDLDPLEFDLLFERFLNPERVSMPD 
DP3A_SALTY : VPVGPGRGSGAGSLVAYALKITDLDPLEFDLLFERFLNPERVSMPD 
BB0579 : IPVGAGRGSGAGSIVAYALRITDIDPLKYNLLFERFLNPERISMPD 

DP3A_HELPY : IPVGPGRGSAAGSLVAFALKITDIDPLKYDLLFERFLNPERISMPD 
AA50 : IPVGPGRGSAGGSLVAYAIGITDVDPIKHGFLFERFLNPERVSMPD 

Konsensus : $PVGSGRGSX$GS$VAXA$XITD$DP$XXX$LFERFLNPER$SMPD 

Pattern : [GAVLIMPFW] -P-V-G- [GAVLIMPFW] -G-R-G-S-X- [GAVLIMPFW] -G-S- 
[GAVLIMPFW] -V-A-X-A- [GAVLIMPFW] -X-I-T-D- [GAVLIMPFW] -D-P- [GAVLIMPFW] -X- 
X-X" (GAVLIMPFW] -L-F-E-R-F-L-N-P-E-R- I GAVLIMPFW] -S-M-P-D 



5DOCID: <DE 19840771A1_L> 
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ZEICHNUNGEN SEITE 8 



Mummer: 
Int C\7: 

Offenlegungstag: 



DE19840771A1 
C 12 N 0^12 

10. Februar2000 



Abbildung 11: 



Gleitklammer (Eubakteria) Region 1: 



AAP0L3B 

DP3B_EC0LI 

S.TYPHIM. 

DP3B_PR0MI 

DP3B_PSEPU 

DP3B_STRC0 

Konsensus 

Pattern 



KLVITGGKSTYKLPTAPAEDFP 
RMLVRSGRSRFSLSTLPAADFP 
RMLVRSGRSRFSLSTLPAADFP 
RLLVRSGRSRFSLSTLPASDFP 
KLLVKAGRSRFTLSTLPANDFP 
RATWCGSSRFTLHTLPVEEYP 

&XX$XXGXSXXXLXT$P$X&XP 

[KRHDE] -X"X- [GAVLIMPFWJ -X-X-G-X-S-X-X-X-L-X-T- 



[GAVLIMPFW] -P- [GAVLIMPFW] -X- [KRHDE] -X-P 



Gleitklammer (Eubakteria) Region 2 



AAPOL3B 

DP3B_EC0LI 

S.TYPHIM. 

DP3B_PR0MI 

DP3B_PSEPU 

DP3B_STRC0 

Konsensus 

Pattern 



IIMPMRV 
WMPMRL 
WMPMRL 
WMPMRL 
WMPMRL 
LIMPVRL 

$$MP$R$ 

[GAVLIMPFW) - [GAVLIMP™] -M-P- [GAVLIMPFW] -R- [GAVLIMPFW] 



30CX10: <DE 19e4a77tAl J_> 
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ZEICHNUNGENSEITE9 Mummer 



DE 19840 771 A1 
lntCl7: C12N$^12 
Offenlegungstag: 10. Feb mar 2000 



Abbildung 12: 



PCNA„HUMAN 

PCNA^METJA 

MTH1312 

PHLA008 

AF033S 



PCNA.HUMAN 

PCNA_METJA 

MTH1312 

PHLA008 

AF0335 



PCNA.HUMAN 

PCNA_METJA 

MTH1312 

PHLA008 

AF0335 



PCNA_HUMAN 

PCNA_METJA 

MTH1312 

PHLA008 

AF0335 



* 20 * 40 * 

LINEACVroiSSSGVNLQSiyroSSHVSLVQLTLRSEXSFDTYRCDRNI^ 
LUDEICFEVDEroiKASAMDPSHVALVSLEIPRIAFEEYEADS-HDIGIDLEAFKKVM 
IVDEVQIQLSAEGLRLDALDRSHITYraLElJCAELFDEyVCDEPERIl^^ 
LIDEAAFKOTEEGISMRAMDPSRVVLIDI^PSSIFSKYE\TOEETIGVN^ 
LVSEARIHFLEKGLHSRAVDPANVAMVIVDIPKDSFEVYNIDEEKTIGVDMDRIFDIS 

60 * 80 * 100 * 

KCAGNEDIITIJlAEDNADTIJa.VFEAPNQEKVSDyEMKIJ©LDV^ IPEQEYSC 

NRAKAKDRLILELDEEKKKLNVIFENTG KRKFSIALLDISASSVK- -VPEIEYPN 

KRAKANDRVILSTDEG-N-LUQFEGEA— VRTFKIRLIDIEYETPS--PPEIEyEN 

KRGKAKDTLILRKGEE-NFLEISLQGTA TRTFRLPLIDVEEIBVE--LPDLPYTA 

KSISTKDLVELIVEDE-STLKVKFGSVE YKVALIDPSAIRKEPRIPELELPA 

120 * 140 * 160 ' * 

WKMPSGEFARICRDLSHIGDAWISCAKDGVKFSASGELGNGNIKLSQTSNVDKEEE 

VIMIKGDAFKEALKDADLFSDWILKADEDKFVIHAKGDIJaENEAIFEKDSSA 

EFEVPFQLLKDSIADIDIFSDKITFRVDEDRFIASAEGEFGDAQIEY 

KVVVLGEVIJCEAVKDASLVSDSIKFMAKENEFIMRAEGETQEV?^^ 

KIVMDAGEFKKAIAAADKISDQVIFRSDKEGFRIEAKGDVDSIVFHMT- ET 

180 * 200 * ""220 * 

AVTIEMNEPVQLTFALRYIJ«FFTKATPLSST7TLSMSADVPLVVEYKIADM-GHr^^ 
1 1 SLEVKEEAKSAFNLDYIJ^DMWGVSSGDIIKI YLGM:)MPLKLEYS lAG - - VNL 
LHGERIDKPARSIYSIX^KIKEKILKADKFSETAIINLGDDMPLKLTLKMASK^ 
LLDIEVQEETKSAYGVSYU^)MVKGIGKADE\m![RFGNEMPMQMEyyira 
ELIEFNGGEARSMFSVDYLKEFCKVAGSGDLLTIHLGTlSrrPVRLWELVGGRAKVEyi 



58 
57 
58 
58 
58 



114 
110 
109 
110 
109 



172 
163 
156 
163 
159 



229 
219 
214 
220 
217 



PCNA.HUMAN • 


: LAPKIED : 


' 236 


PCNA_METJA • 


' LAPRIEG 


: 226 


MTH1312 


: LAPRIEA 


221 


PHLA008 


: LAPRA^E 


. 227 


AF0335 


: LAPRIES 


, 224 



JDOCID: <DE 1 9840771 A1J_> 
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2EICHNUNGEN SEfTE 10 



Nummer: 
Int. 07: 

Offenlegungstag; 



DE 19840 771 A1 
C12N $t^2 

10. Februar2(X)0 



Abbildimg 13: 



* 20 * 40 * 

AAP0L3B : MRVKVDREEI^EVUOCARESTEKKAALPIUO^LLSAKEEISa^ 

DP3B_EC0LI : MKFTVEREHLIJa>LQQVSGPLGGRPTLPIIiGm,LLQVAIX3TLSLTCTDLEME^ 

S . TYPHIM . : MKFTVEREHLUCPLQQVSGPIXSGRPTLPILGNLLLQVADGTLSL^ 

DP3B_PR0MI : MKFIIEREQLUCPLQQVSGPl^GRPTLPIUSNLLIJCVTENTLSLTGTO^ 

DP3B_PSEPU : MHFTIQREALIJCPLQLVAGVVERRQTLPVLSNVLLWQGQQLSLTGTDLE<^ 

DP3B_STRC0 : MKIRVERDVIAEAVAWAARSLPARPPAPVIJlGLUJCAEEGQLSLSSro 



58 
58 
58 
58 
58 
58 



60 * 80 * 100 * 

AAP0L3B : GEVEEE-GEVCVHSQKLYDIVKNL-NSAYVYLHTEGEKLVITGGKSTYKLPTAPAm : 114 

DP3B_EC0LI : LVQPHEPGATIVPARKFFDICRGLPEGAEIAVQLEGERMLVRSGRSRFSLSTLPAADF : 116 

S. TYPHIM. : LSQPHEPGATTVPARKFFDICRGLPEGAEIAVQLEGDRMLVRSGRSRFSLSTLPAADF : 116 

DP3B_PR0MI : LSQSHEIGATTVPARKFFDIWRGLPEGAEISVEUXSDRLLVRSGRSRFSLSTLPASDF : 116 

DP3B_PSEPU : LEEPAEPGEIWPARKLlffilCKSLPNDALIDIKVDEQKLLVKAGRSRFTLSTLPANDF : 116 

DP3B_STRC0 : AEIEEE-GT\a.VSGRLUU5ISRAL-PNRPVEISTDGVRAT\AACGSSRFTLHTLPVEEY : 114 



120 * 140 * 160^ * 

AAP0L3B : PEFPEIVE-GGETLSGNLLWGIEKVEYAIAKEEANIALQGMyLRGYEDRIHFVGSDG : 171 

DP3B_EC0LI : PNU^DWQSEVEFTLPQATMKRLIEATQFSMAHQDVRYYU^GMryETEGEE^ : 174 

S. TYPHIM- : PNLDDWQSEVEFTLPQATMKRLIESTQFSMAHQDVRYYIiNGMLFETEGSELRTVATre : 174 

DP3B_PR0MI : PNU)DWQSEVEFTLPQATIJaU.IESTQFSMAHQDVRYYLNGMLFETENTEliRTV^ ; 174 

DP3B_PSEPU : PTTOEGPGSLTCNLEQSKLRRLIERTSFAMAQQDTOYYIJ^JGMLLEVSRNTU^ : 174 

DP3B_STRC0 : PALPQMPE-ATGTVPGEVFASAVQQVAIAAGRDDTLPVLTGVRIEIEGDSVTLASTDR : 171 



AAP0L3B 

DP3B_EC0LI 

S . TYPHIM . 

DP3B„PR0MI 

DP3B_PSEPU 

DP3B_STRC0 



180 * 200 * 220 * 

HRLALYEPLGEFSKEL LIPRKSLKVLKKLITGIEDVNIEKSE— DESFAYFS : 221 

HRIJ^VCSMPIGQSLPS--HSVIVPRKGVIEU4W4UX;-GDNPlJlVQI---GSmiRM : 226 

HRLAVCSMPLEASLPS--HSVIVPRKGVIELMRMLr)G-GENPLRVQI GSmiRAH : 226 

HRIAVCAMDIGQSLPG--HSVTVPRKGVIELMRLLDGSGESLLQLQI GSNNLRAH : 227 

HRLALCSMSAPIEQEDR-HQVIVPRKGILELARLLTD-PEGMVSIVL GQHHIRAT : 227 

YRFAVREFLWKPENPDISAVALVPAKTLQDTAKALTSGDQVILALSGSGAGEGLIGFE : 229 



240 * 260 * 280 * 

AAP0L3B : TPEWKIAVRIJ:-EGEFPDYMSVIPEEFSAEVLFETEEVLK\njaU.KALSEGKVFP^ : 279 

DP3B_EC0LI : VGDFIFTSKLV1X5RFPDYRRVLPKNPDKHLEAGCDLLKQAFARAAILSNEKFRGVRLY : 284 

S. TYPHIM. : VGDFIFTSKLVDGRFPDYRRVLPKNPDKHLEAGCDILKQAFARAAILSNEKFRGV^ : 284 

DP3B_PR0MI : VGDFIFTSKLVDGRFPDYRRVLPKOTTKTVIAGCDILKQAFSRAAILSNEKFRGVRIN : 285 

DP3B„PSEPU : TGEFTFTSKLVDGKFPDYERVLPKGGDKLWGDRQALREAFSRTAILSNEKYRGIRLQ : 285 

DP3B_STRC0 : GAGRRTTTRLLEGDLPKYKTLFPTEFNSVAVIETAPFVEAVKRVALVA-ERNTFVRLS : 286 



300 * 320 * 340 

AAP0L3B : LSENIAIFEFADPEFGEAREEIE\^YTGEPFEIGFNGKY1J4EALDAYDSERVWFKFT^ 

DP3B_EC0LI : VSENQUaTANNPEQEEAEEIU^\nTSGAEMEIGFWSYVL0VimLKCEN^ 

S . TYPHIM . : VSENQUCITAI^EQEEAEEIU:)VSYGGTE^lEIGFWSYVLDVl^UCCET^mI^ 

DP3B_PR0MI : LTNGQIJCITANNPEQEEAEErTOVQYQGEEMEIGFNVSYLLDVIJ»rrLKCEE\^ 

DP3B„PSEPU : LAAGQUCIQANNPEQEEAEEEISVDYEGSSLEIGFNVSYLLDVLGVMTTEQV^ 

DP3B_STRC0 : FEQGVLILEAGSSDDAQAVERVDAQLEGDDISIAFNPTFLLDGLSAIDSPVAQLSFTT 



337 
341 
341 
342 
342 
344 



* 360 * 380 

AAP0L3B : PDTATLLEAEDYEK-EPYKCIIMPMRV-- 

DP3B_EC0LI : DSVSSVQIEDAAS-QSAAYWMPMRL— 

S. TYPHIM. : DSVSSVQIEDAAS-QSAAYWMPMRL-- 

DP3B_PR0MI : DAVSSVQVENVAS- AAAAYWMPMRL- - 

DP3B_PSEPU : DSNSSALLQEAGN-DDSSYWMPMRL-- 

DP3B_STRC0 : STKPALLSGRPAVDAEADEAYKYLIMPVRLSG 



363 
366 
366 
367 
367 
376 



902 06^9 



.19e<0771A1_L> 



ZEICHNUNGEN SEITE 1 1 N^rnmer DE 19840 771 Al 

lnt.Cl7: C12N 9/12 

Offenlegungstag: 10. Februar 2000 

Abbildung 14: 



* 20 * 40 * 60 

ACll.HUMAN : ICNYLSKIIPALQSRCTRFRFGPLTPEI^PRLEHVVEEEKVDISEDGM : 60 

AF2060 : SCNWSRIIEPIQSRCAVFimCPVPKEAMKKRLLEICEKEGVKITi^ : 60 

MTH0241 : SClTirSSKIIDPIQSRCAIFRFLPLKGHQIIKRLEYIAEKENLEYEAHALETI^^ : 60 

PHHN012 : SCNYSSKIIEPIQSRCAIFRFRPLRDEDIAKRLRYIAENEGLELTEEGLQAILYIAEG : 60 

MJ1422 : NCNYPSKIIPPIQSRCAWRFSPLKKEDIAKKLKEIAEKEGLNLTE^ : 60 



* 80 * 100 * 120 

ACl INHUMAN : RRAI^ILQSTNMAFGKVTEETVYTCTGHPLKSDIANIU^WM^ : 120 

AF2060 : RKAINALQGAAAIGEVAO^ADTIYQITATARPEEMTELIQTAUCGNFMEARELLDR : 120 

MTH0241 . : RKAINI^QSAASI/;EKITESSIyDWSRARPKD^mKMIKTIUX3KFME^ : 120 

PHBN012 : RRAINILQAAAAU)KKITDENVFMVASRARPEDIREMMLIAUCGNFLKAR : 120 

MJ1422 : RKAINNnL.QTAAALSDVXDDEIVYKVSSRARPEEVKKMMElAIJX^ : 120 



AC 1 INHUMAN : GLALHDILTEIHLFVHRVD : 139 

AF2060 : GMSGEDIVAQLFREIISMP : 139 

MTH0241 : GISGEDMVTQIYQELSRLA : 139 

PHBN012 : GbSGEDVLIQMHKEVFNLP : 139 

MJ1422 : GMSGEDILNQMFREINSLD : 139 



3DOCID: <DE 1 9840771 AlJ_> 
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ZEICHNUNGEN SEUE 12 Nummer: DE 19840 771 A1 

IntClJ: C12N 
Offenlegungstag: 10. Februar 2000 



IntClJ: C12N 



Abbildung 15: 



* 20 * 40 * 

AC15_HUMAN : LLVA/DKYKPTSLKTIIGQQGDQSCANKLU^V«J^QKSSSm : 58 

MJ0884 : LSWVEKyRPKSUa5VAG---HEKVKEKLKTWIESyi^ : 37 

AF1195 : MLWVEKYRPKTLEEWA DKSIITRVIKWAKSWKRG : 35 

MTH0240 : MSWrEKYRPCSFDEWG NQKVIAEIKEWIKAWKAGKP : 37 

PHBN013 : VPWIEKYRPRKLSEIVN QEQALEKVRAWIESWLHGNPPK : 39 



60 * 80 * 100 * 

AC15_HUMAN : SSFKAALLSGPPGVGKTTTASLVCQEIXYSYVELNASiyn^SKSSLKAIVAESL^^ : 116 

MJ0884 : --PKPILLVGPPGCGKTTIAYAlANDyGFEVIE12aASDKra«SSAIKKWGHAATSSS^ : 93 

AF1195 : --SKPLLIAGPPGVGKTSIJaAlAiroiGWEAVEIJ^SDQRSra : 91 

MTH0240 : --QKPLLLVGPPGTGKTTIJVHIIGKEFS-DTLEIJ«ASDRRSQDALMRSAGEASATRSL : 92 

PHBN013 : --KKALLIAGPPGSGKTTTVYAUVHEYNFEVIELNASDERT^ : 94 



120 * 140 * 160 * 

AC15_HUMAN : KGFYSl^AASSVSTKHALIMDEVIXSMAGNEDRGGIQELIGLIK^HTKIPIIC^ : 173 

KJ0884 : FG KKF-LIVLDEVDGISGKEDAGGVSELIKVIK-KAKNPIILTANDAY : 139 

AF1195 : SDEGEF-LSSRIGKLKLIIIJDEVDNIHKKEDVGGEAALIRLligRCT : 148 

MTH0240 : FN HDUa.IILDEVIX3IHGl^RGGVQAINRIIK-ESRHPMVLTANDPY : 139 

PHBN013 : MG KRRKIIFLDEAI»JIE-"P--SGAPEIAKLID-KARWPIIMAANHYW : 137 



180 * 200 * 220 * 

AC15_HUMAN : HPKIRSLVHYCFDLRFQRPRVEQIKGAMMSIAFKEGLKIPPPAMNEIILGANQDIRQV : 231 

MJ0884 : APSIRSLLPYVEVIQLNPVHTNSVYK\^KKIAEKIX3LDVDDKTUCM : 197 

AF1195 : KLSP-ELRNLCENTOJFKRLTKQQVARVLERIALKEGIKVDI^^ : 205 

MTH0240 : SKM*QSIKPRCR\a2»IUUC\mTSSIAAAXJUtICRAEGIECPDD^^ : 197 

PHBN013 ; E\^K-EIRBRAELVEyKRIJaQRI5VISALVRILKREGITVPKEILTEIAKRSSGDLRAA : 194 



240 * 260 * 280 * 

AC15„HUMAN : LHI^SMWCARSKALTVDQAKADSHRAKKDIKMGPFDVARKVFAAGEET;^ : 289 

b4J0884 : INDLEALALSGDLSYEAAQKLP DRKREANIFDALRYILKTTHYGIATTALMN-- : 249 

AF1195 : INDFQALAEGKEELKPEDVFLT KRTQEKDIFRVMQMIFKTKNPAVYNEAML : 256 

MTH0240 : INDLEAMAEGEERIGEELLKMG EKDATSNLFDAVRAVLKSRJDVSKVREAMR : 248 

PHBN013 : INDLQTIVAGG— YEDAKYVLA YRTJVEKTVFQSI^MVFSSDNAKRAKLALMN-- : 244 



300 * 320 * 340 

AC15_HUMAN : LFFHDYSIAPLFVQENYIHVKPVAAGGDMKKHl^LSRAADSICDGDLVDSQI : 347 

MJ0884 : VDETPDWIEWIAENVPKEyEKPEEVARAFEYLSKADRyLGRVMRRQN--ySF : 300 

AF1195 : LDESPEDVrHWVDEKrLPLEySG-VELVNAyEALSRADIFLGRVRRRQF--YRL : 306 

MTH0240 : VDDDPTLVL.EFIAENVPREYEKPNEISRAYDMLSRADIFFGRAVRTRN--yTy : 299 

PHBN013 : LDMSPDEFlXWVDENIPHMYLKPEEMARAYEAISRADiyiiGRAQRTGN--YSL : 295 



AC15_HUMAN 

MJ0884 

AF1195 

MTH0240 

PHBN013 



360 * 380 * 400 

WSLLPAQAIYASVLPGEl^GYMTQFPTFPSVOXSKHSSTGKHDRrVQDIALHMSLRTY : 405 

WKYATTI^TAGVALSKDEKYRKWTPYSY-PKIFRLLTKTKAEREIimilJCKIGEKTH : 357 

WKYASYU4TVGVQQMKEEPKKGFTRYRR-PAWQMLFQIJlQKR£KrRKIL£^ : 3 63 

VmYASEU^GPGVAlAraKTYRKFNmYTG-SSSFRIUSKTRKQRSLRDSVA^^^ : 356 

wkyaidmmtagvavagtk-kkgfakfyp-pntlkmi:aeskeersirdsiikkimk^^ : 351 



* 420 

AC15_HUMAN : SSKRTVNMDYLSLLR : 420 

MJ0884 : TSSKRAR- FDLQMLK : 371 

AF1195 : LSMRKARTEMFPVIK ; 378 

MTH0240 : ISPKVAI-SMFPYME : 370 

PHBN013 : MSKLEAL-ETMKILR : 365 
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Abbildung 16: 



Nummer 
Int. Cl7: 

Offeniegungstag: 



DE 19840 771 A1 
C12N S/12 

10. Februar2000 



* 20 * 40 * 

PHBN023 : IFEVEDQTDRVKVFLPKDSED-YREALK\a>PDAWAFKGVYSKRG-IFFANRFYLPDV : 56 

PfuORF2 : IFEIEDLTGKVKVFLPKDSED*YREAFKVLPDAWAFKGWSKRG-ILYAN^ : 56 

DPD2_HUMAN : -LVLEDELQRIKLKGTIDVSK --LVTGTVLAVFGSVRDDGKFLVEDYCFADLA : 50 

MTH1405 : IIELEDDTGEISVV\mNENHKLFEKSEKIVRDEWGVHGTKKGR--FWASEIFHPGV : 56 

AF1790 z YIRLEDTTGTITCVATGKNAE VARELLGDEVIGVTGLLKGS--SLYANRIVFPDfV : 53 

MJ0702 : IVRIEDTEDEATLILPKEKIEAGKIPDDILLDEVTIGAIGTVSKSGSSIYVDEIIRPAL : 58 

60 * 80 * 100 * 

PHBN023 : PLYRK-QKPPLEEKVYAVLTSDIHVGSK--EFCEKAFIKFLEWn^GYVESKEEEE : 111 

PfuORF2 : PLYRR-QKPPLEEK\nrAILISDIHVGSK--EFCEraFIKFLEVinL^GNVETra • 111 

DPD2_Hl)MAN : PQKP--APP-IJOTDRFVLLVSGLGLGGGGGESLLGT-QLLVDWTGQLGDEGEQCSAA : 104 

MTH1405 : PRIQ-*EK---EMDFSVAFISDVHIGSQ--TFLEDAFMKFVICWINGDPGSEEQRSLAA : 107 

AF1790 : PINGNGEK---KRDFYIWLSDTHFGSK--EFLEKEWEMFVRV^ : 106 

MJ0702 : PPKEP-KR--IDEEIYMAFLSDIHVGSK— EFLHKEFEKFIRFLNGDVDNELEEKWS : 111 



PHBN023 

PfuORF2 

DPD2_HUMAN 

MTH1405 

AF1790 

MJ0702 



120 * 140 * 16-6 * 

RIRYL:IAGDWD-GIGIYP-GQYSDLIIPDIFDQYEALANI;LSNVP KHITIFI • 163 

RVKYLIIAGDVVD-GVGVYP-GQYADLTIPDIFDQYEALANiE-SHVP KHITMFI : 163 

HVSRVIUVGNLLSHSTQSRDSINKAKYLTKKTQAASVEAVKML^ : 162 

DVKYLWAGDIVD-GIGIYP-GQEKELLIRDIHEQYEEAARLFGDIR SDIKIVM • 159 

KVKYrVIAGDIVD'GIGVYP-GQEDDLAISDIYGQYEFAASHLDEIP KEIKIIV : 158 

RLKYICIAGDLVD-GVGVYP-GQEEDLYEVDIIEQYREIAMYLDQIP EHISIII • 163 



PHBN023 

PfuORF2 

DPD2_HUMAN 

MTH1405 

AF1790 

MJ0702 



180 * 200 * 220 * 

GPGNHDAARPAIPQPEFYEEYAKPLYKLKmVIISNPAVIRLHGRDFLIAHGRGIEDV • 221 

APGNHDAARQAIPQPEFYKEYAKPIYKLKKAVIISNPAVIRLHGRDFLIAHGRGIEDV : 221 

MPGEFDPTNYTLPQQPLHPCMFPUVTAYSTl^LVraPYQATIIXSVRFI/ST^^ : 220 

IPGNHDSSRIAEPQPAIPEEYAKSLYSIRNIEFLSWPSUVSLDGVRTLIYHGRSFDDM : 217 

SPGNHDAVRQAEPQPAFEGEI-RSLFP-KNVEHVGNPAYVDIEGVKVLIYHGRSIDDI • 214 

SPGNHDAVRPAEPQPKLPEKITKLFNR-DNIYFVGNPCTLNIHGFDTLLYHGRSFDDL • 220 



PHBN023 

PfuORF2 

r>PD2_HUMAN 

MTH1405 

AF1790 

MJ0702 



240 * 260 * 280 * 

VSFVPGLTHHKPGLPM\^LLKMRHLAPTFGGKVPIAPDPE-DLLVIEEVPDLV(»4GHV : 278 

VGSVPGLTHHK?GLPM\^LLKMRHVAPMFGGKVPIAPDPE-DLLVIBEVPDVVHMGHV : 278 

FRYS SMEDHLEILEViTrU^VRHISPTAPDTLGCYPFYKTDPFIFPECPHVYPCGNT - 27 5 

AMSraGLSHERSDLIMEELLEKRHLAPIYGERTPIASEIE-DHLVIDEVPHVLHTGHV • 274 

ISKIPRLSYDBPQKVMEELLKRRHLSPIYGGRTPLAPERE-DYLVIEDVPDILHCGHI • 271 

VGQIRAASYENPVTIMKELIKRRLLCPTYGGRCPIAPEHK-DYLVIDRDIDILHTGHI • 277 



300 * 320 * 

PHBN023 : HVTDTAVYRG VQLVNSATWQAQTEFQKMVNIVPTPGLVPIVD 

PfuORF2 : HVYDAWYRG VQLVNSATWQAQTEFQKMVNIVPTPAKVPWD 

DPD2_HUMAN : PSFGSKIIRGPEDQTVLLVTVPDF-SATQTACLVKTLR-SLACQPISF 

MTH1405 : HINAYKKYKG VHLINSGTFQSQTEFQKIYNIVPTCGQVPVLN 

' HTYGTGFYRG VFMVNSSTWQAQTEFQKKVNLNPMPGNVAVYR 

MJ0702 : HINGYGIYRG WMVNSGTFQEQTDFQKRMGISPTPAIVPIIN 



320 
320 
320 
316 
313 
319 
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Abbildung 17: 



* 20 * 40 * 

DPOD_HUMAN : KVQSYEKEE3)LU)AWSTFIRIMDPDVITGYNIQNFDLPYLISI^QTLKVQTFPFLGR^ : 58 

DPOL.ARCFU : EIILTGDERKIISDFVKLIKSYDPDIIVGVNQDAFDWPYLRKRAERWNIPLD--VG-- : 54 

MTH1208 : r>rEVVEDERELLE3U^AEIVIDKKPDILVGyNSDNFDFPyiTKRAAILGAELD--I^ : 54 

PHBT047 : YVE\A^SSEREMIKRLIRVIKEKDPDVIITYNGDNFDFPYLUaiAEKIX; : 54 

MJ0885 : NIE\AnCNEKELIKKIIETU(Ey--DVIYTYNGDNFDFPYIJCARAKiyGIDrN--I/5-- : 52 



60 * 80 * 100 * 

DPOD_HUMAN : AGLCSNIRDSSFQSKQTGRRDTKWSMVGRVQMDMLQVLLREYKLRSHTLNAVSFHFL : 116 

DPOL_ARCFU : RDGSNWFRGG RPKITCRLNVDLyDIAMRISDIKIKKLE^A^AEFLG : 100 

MTH1208 : WTCSKIRTORRG-FANATAIKGTraVDLYPVMRRYMNUJRYTLERVYQELF - 104 

PHBT047 . : . RDNSEPKMQKMG-DSLAVEIKGRIHFDLFPVIRRTINLPTYTLEAVYEAIF : 104 

MJ0885 : -KDGEELKIKRGG-MEYRSYIPGRVHIDLYPISRRLLKLTKYTLEDVVYNLF : 102 



120 * 140 * 160 * 

DPOD_HUMAN : GEQKE-DVQHSIITDLQNGNDQTRRRIAWCUCDAYLPU^ - 173 

DPOL_ARCFU : TKIEIADIEAKDIYRYWSRGE--KEK\a^ARQDAINTYLIAK--ELLPMHYE^ : 154 

MTH1208 : GEEKI-DLPGDRLWEYWDRIDEL-RDEIJTlYSIX^DWATHRl)^--KILPIJ«.ELTR^ • 158 

PHBT047 : GKPKE-KVYADEIAKAWETGEG-LERVAKYSMEDAKSnrYE - 158 

MJ0885 : GrEKL-KIPHTKIVDYWAiaND---KTLIEYSLQDAKYTyKIGK--YFFPLEVMF^ • 154 



180 * 200 * 220 * 

GVPLSYLLSRGQQVKWSQLLRQAMHEGLLMPWKSE GGEDYTGATVIEPLK : 225 

RLPVDDVTRMGRGKQVDWLLLSEAKKIGEIAPNPPE HABSYEGAFVLEPER • 205 

GQPLFDISRMATGQQAEWFLVRKAYQYGELVPNKPSQSDFSSRRGRRAVGGYVKEPEK • 216 

GQPVVmvSRSSTGNLVEWFLLRKAYERNEIJ^Pl^PDEKEYERRU^ : 216 

NQTPFEITRMSSGQMVEYLimRAFKENMrVPNKPDEEEYRRRVLTTYEGGYV^ • 212 



DPOD.HUMAN 

DPOL_ARCFU 

MTH1208 

PHBT047 

MJ0885 





240 




DPOD_HUMAN 


: GYYDVPIATLDFS : 


238 


DPOL ARCFU 


: GLHEN-VACLDFA : 


217 


MTH1208 


: GLHEN-IVQFDFR : 


228 


PHBT047 


: GLWEG-IVSLDFR : 


228 


MJ0885 


. GMFED-IISMDFR : 


224 
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Abbildung 18: 



* 20 * 40 * 60 

AF1722 : DTIKGWGMTSKTKIPERLEKGIl^VKHGVFVFKTCTARFDATO^ 
MJ1630 ; GDVKCIKGMTSKQKIVEPLEKAILRAII^IEVYVPKrXSTTRFD^ 
PfuORF3 : DKLKGWGMTSGWKIAEPLEKGLIJRAKNE\rmraGTIRFDATDAPITHFRP 
MTH1536 : DEIKGVEGMISAEKFPEPLEKGILRAKND\^TFKDATIRHDSTDLPLTH^^ 
PHBN021 : DKLKGVMGMTSGWKMPEPLEKGLLRAKND\m^PKIX5TIRFDATDAPrTHF^ 



63 
63 
63 
53 
63 



* 80 * 100 * 120 

AF1722 : EI^YERDYKGAELKNENQIVELKPQIWILPKSGAEYLUIVANFIDDIXVK^^ : 126 

MJ1630 : ELGYDKDIYGiraiVreEQVVEUCPQrn^IIPESCAEyFVKVANFID^^^ : 126 

PfuORF3,: ELGYTHDFEGKPLVSEDQIVEUCPQDVILSKEAGKYLUIVARFVDDLLEK^ : 126 

MTH153 6 : EU3YTRDCyGDELEDEDQILELRVQDWISEDCADYLVRVANFVDDU.ER^ : 126 

PHBN021 : EUSYTHDFEGNPLVSEDQIVELKPQDIILSKEAGKyUUCVAKFVDDLLEK^ : 126 



* 140 * 160 * 

AF1722 : EDLIGHLVIGLAPHTSAGVLGRIIGFSDVLAGYAHPYFHAAKRR :-170 
MJ1630 : EDLIGHLVIGMAPHTSAGMVGRIIGYTKANVGYAHPYFHAAKKR t 170 
PfuORF3 : EDLIGHLVIGLAPHTSAGIVGRIIGFVDALVGYAHPYFHAAKRR\ 170 
MTH1536 : EDLVGHLIAGLAPHTSAAVIiGRIIGFTCASACYAHPYFHSAKRR ? 170 
PHBN021 : EDLIGHLVIGLAPHTSAGIVGRIIGFVDALVGYAHPYFHAAKRR : 170 



Abbildung 19: 



* 20 * 40 * 60 

ELQVINQMGFPGYFLIVMEFIQWSKDNGVPVGPGRGSGAGSLVAYAUCITDLDPLEFDLL : 60 

ELQVINQMGFPGYFLIVMEFIQWSKDNGVPVGPGRGSGAGSLVAYALKITDIiDPLEFDLL : 60 

ELSVIIGMGFEGYFLrSAtfDFIKFAHDNDIPVGAGRGSGAGSIVAYALRITDIDPUCYl^ : 60 

EIEVITNMKFPGYMLIVVroFIRYAKEMGIPVGPGRGSAAGSLVAFAUCITDro : 60 

ELEVINKMGFAGYFLXVQDFIimAKKNDIPVGPGRGSAGGSLVAYAIGITDVDPIKHGFL : 60 



DP3A_EC0LI 

DP3A_SALTy 

BB0579 

PP3A_HELPy 

AA50 



* 80 * 100 * 120 

0P3A„ECOLI : FERFIJ^PERVSMPDFDVDFCMEKRDQVIEHVADMYGRDAVSOIITFGT^^^ : 120 

DP3A_SALTy : FERFUNPERVSMPDFDTOFCMEKRI)QVIEHVADMYGRDAVSQIITPGTM^^ • 120 

BB0579 : FERFI^PERISMPDFDIDPCFEGRDEIIKYVTNKYGEDKVAQIITPGTLKPKAVVKDVAR ■ 120 

DP3A_HELPY : FERFIJIPERISMPDIim)FCQRRRKEIIEYMIEKYGKYWAQVITFNKh!^^ : 120 

AA50 : FERFU^PERVSMPDID\mFCQDNREKVIEYVR^^CyGHDN^ : 120 



DP3A_EC0LI : VLGHPYGFVDRISKLIPP : 138 

DP3A_SALTY : VLGHPYGFVDRISKLVPP : 138 

BB0579 : VLDIPFAESNELTKFIPD : 138 

DP3A_HELPY : VLDMPYKEADDFAKLIPN : 138 

AA50 : AMGLPYSTADKLAKLIPQ : 138 
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